From: Stefan Hajnoczi <stefanha@redhat.com>
To: "Philippe Mathieu-Daudé" <philmd@oss.qualcomm.com>
Cc: qemu-devel@nongnu.org, "Paolo Bonzini" <pbonzini@redhat.com>,
"Markus Armbruster" <armbru@redhat.com>,
"Igor Mammedov" <imammedo@redhat.com>,
"Fam Zheng" <fam@euphon.net>,
"Laurent Vivier" <lvivier@redhat.com>,
"Thomas Huth" <thuth@redhat.com>,
"Daniel P. Berrangé" <berrange@redhat.com>,
"Eduardo Habkost" <ehabkost@redhat.com>,
qemu-block@nongnu.org, "Maxim Levitsky" <mlevitsk@redhat.com>,
"Michael S. Tsirkin" <mst@redhat.com>,
"Jason Wang" <jasowang@redhat.com>,
"John Snow" <jsnow@redhat.com>,
"Stefan Berger" <stefanb@linux.ibm.com>,
"Pierrick Bouvier" <pierrick.bouvier@oss.qualcomm.com>,
"Mark Cave-Ayland" <mark.cave-ayland@ilande.co.uk>
Subject: Re: [PATCH v7 09/13] device-core: use atomic_set on .realized property
Date: Thu, 18 Jun 2026 10:55:31 -0400 [thread overview]
Message-ID: <20260618145531.GB720244@fedora> (raw)
In-Reply-To: <f67389ae-4b0e-46f5-a122-d83464f192f4@oss.qualcomm.com>
[-- Attachment #1: Type: text/plain, Size: 5738 bytes --]
On Thu, Jun 18, 2026 at 11:39:11AM +0200, Philippe Mathieu-Daudé wrote:
> Hi,
>
> Old patch committed as a23151e8cc8 (Oct 2020).
> I'm revisiting it in the context of dynamic heterogeneous machines.
>
> On 6/10/20 14:39, Maxim Levitsky wrote:
> > Some code might race with placement of new devices on a bus.
>
> So IIUC at least 2 vCPUs plug distinct devices on the same bus,
> and the problem is the bus slot assigned for the device. What is
> deciding the free slot, the guest code or QEMU?
Hi Philippe,
The issue was not about a race in slot assignment, it was about hotplug
racing with an IOThread. When the guest triggers device emulation
activity in an IOThread and the monitor thread is doing hotplug, then
there was a race condition since the IOThread does not execute under the
Big QEMU Lock.
In other words, the virtio-scsi device model must not see partially
hotplugged/unplugged devices when processing SCSI requests in an
IOThread.
>
> > We currently first place a (unrealized) device on the bus
> > and then realize it.
>
> Sound like ill design. Maybe resulting from the unclear definition
> on what "realized" means. "Guest visible"? "Wired on some bus"?
Back when everything ran inside the BQL it worked fine because device
emulation activity would not happen during hotplug. Hotplug was atomic.
IOThreads broke the BQL assumption and that's why the race condition
occurred.
>
> Something I tried to clarify once (now outdated):
> https://lore.kernel.org/qemu-devel/20240209123226.32576-1-philmd@linaro.org/
>
> A vCPU thread shouldn't poke at QOM/QDev internals. When it accesses
> a device (to check "is it out of reset?"), this is already existing
> realized.
>
> Actually, when the guest asks to assign the device to the bus, the
> device is already existing (from guest PoV), so must be realized.
I'm not sure I understand. The guest doesn't ask to assign a device to a
bus. QEMU assigns devices to busses. (I'm ignoring cooperative hotplug
and hot unplug workflows where the device and the driver communicate in
a multi-step process.)
>
> > As a workaround, users that scan the child device list, can
> > check the realized property to see if it is safe to access such a device.
> > Use an atomic write here too to aid with this.
>
> I'd appreciate to be pointed at real use case we fixed here, to
> better figure what could be reworked on the QBus / QDev layer in
> order to not depend anymore on this workaround.
SCSI controller emulation involves iterating over the SCSI devices on
the bus. This can run in an IOThread outside the BQL. Hotplug modifies
the bus and this needs to be done in a thread-safe way. Otherwise the
IOThread seem partially hotplugged/hot unplugged devices on the SCSI
bus.
This patch series provided a way for devices (e.g. virtio-scsi) that
face this problem to safely skip devices on their busses that are not
realized.
> > A separate discussion is what to do with devices that are unrealized:
> > It looks like for this case we only call the hotplug handler's unplug
> > callback and its up to it to unrealize the device.
>
> Likely the hotplugging path is what needs to be improved (I'm seeing
> a similar problem with vCPU hotplug).
>
> Thanks,
>
> Phil.
>
> > An atomic operation doesn't cause harm for this code path though.
> >
> > Signed-off-by: Maxim Levitsky <mlevitsk@redhat.com>
> > Reviewed-by: Stefan Hajnoczi <stefanha@redhat.com>
> > Message-Id: <20200913160259.32145-6-mlevitsk@redhat.com>
> > Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>
> > ---
> > hw/core/qdev.c | 19 ++++++++++++++++++-
> > include/hw/qdev-core.h | 2 ++
> > 2 files changed, 20 insertions(+), 1 deletion(-)
> >
> > diff --git a/hw/core/qdev.c b/hw/core/qdev.c
> > index 59e5e710b7..fc4daa36fa 100644
> > --- a/hw/core/qdev.c
> > +++ b/hw/core/qdev.c
> > @@ -946,7 +946,25 @@ static void device_set_realized(Object *obj, bool value, Error **errp)
> > }
> > }
> > + qatomic_store_release(&dev->realized, value);
> > +
> > } else if (!value && dev->realized) {
> > +
> > + /*
> > + * Change the value so that any concurrent users are aware
> > + * that the device is going to be unrealized
> > + *
> > + * TODO: change .realized property to enum that states
> > + * each phase of the device realization/unrealization
> > + */
> > +
> > + qatomic_set(&dev->realized, value);
> > + /*
> > + * Ensure that concurrent users see this update prior to
> > + * any other changes done by unrealize.
> > + */
> > + smp_wmb();
> > +
> > QLIST_FOREACH(bus, &dev->child_bus, sibling) {
> > qbus_unrealize(bus);
> > }
> > @@ -961,7 +979,6 @@ static void device_set_realized(Object *obj, bool value, Error **errp)
> > }
> > assert(local_err == NULL);
> > - dev->realized = value;
> > return;
> > child_realize_fail:
> > diff --git a/include/hw/qdev-core.h b/include/hw/qdev-core.h
> > index 2c6307e3ed..868973319e 100644
> > --- a/include/hw/qdev-core.h
> > +++ b/include/hw/qdev-core.h
> > @@ -163,6 +163,8 @@ struct NamedClockList {
> > /**
> > * DeviceState:
> > * @realized: Indicates whether the device has been fully constructed.
> > + * When accessed outsize big qemu lock, must be accessed with
> > + * atomic_load_acquire()
> > * @reset: ResettableState for the device; handled by Resettable interface.
> > *
> > * This structure should not be accessed directly. We declare it here
>
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 488 bytes --]
next prev parent reply other threads:[~2026-06-18 14:56 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-10-06 12:38 [PATCH v7 00/13] Fix scsi devices plug/unplug races w.r.t virtio-scsi iothread Maxim Levitsky
2020-10-06 12:38 ` [PATCH v7 01/13] qtest: rename qtest_qmp_receive to qtest_qmp_receive_dict Maxim Levitsky
2020-10-12 11:13 ` Thomas Huth
2020-10-06 12:38 ` [PATCH v7 02/13] qtest: Reintroduce qtest_qmp_receive Maxim Levitsky
2020-10-12 11:14 ` Thomas Huth
2020-10-12 13:47 ` Paolo Bonzini
2020-10-12 13:49 ` Thomas Huth
2020-10-12 16:56 ` Paolo Bonzini
2020-10-06 12:38 ` [PATCH v7 03/13] qtest: switch users back to qtest_qmp_receive Maxim Levitsky
2020-10-06 12:56 ` Paolo Bonzini
2020-10-06 13:17 ` Maxim Levitsky
2020-10-06 12:38 ` [PATCH v7 04/13] qdev: add "check if address free" callback for buses Maxim Levitsky
2020-10-06 12:38 ` [PATCH v7 05/13] scsi: switch to bus->check_address Maxim Levitsky
2020-10-06 12:38 ` [PATCH v7 06/13] scsi/scsi_bus: switch search direction in scsi_device_find Maxim Levitsky
2020-10-06 12:38 ` [PATCH v7 07/13] device_core: use drain_call_rcu in in qmp_device_add Maxim Levitsky
2020-10-06 12:38 ` [PATCH v7 08/13] device-core: use RCU for list of children of a bus Maxim Levitsky
2020-10-06 12:39 ` [PATCH v7 09/13] device-core: use atomic_set on .realized property Maxim Levitsky
2026-06-18 9:39 ` Philippe Mathieu-Daudé
2026-06-18 14:55 ` Stefan Hajnoczi [this message]
2020-10-06 12:39 ` [PATCH v7 10/13] scsi/scsi-bus: scsi_device_find: don't return unrealized devices Maxim Levitsky
2020-10-06 12:39 ` [PATCH v7 11/13] scsi/scsi_bus: Add scsi_device_get Maxim Levitsky
2020-10-06 12:39 ` [PATCH v7 12/13] virtio-scsi: use scsi_device_get Maxim Levitsky
2020-10-06 12:39 ` [PATCH v7 13/13] scsi/scsi_bus: fix races in REPORT LUNS Maxim Levitsky
2020-10-06 13:01 ` [PATCH v7 00/13] Fix scsi devices plug/unplug races w.r.t virtio-scsi iothread no-reply
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260618145531.GB720244@fedora \
--to=stefanha@redhat.com \
--cc=armbru@redhat.com \
--cc=berrange@redhat.com \
--cc=ehabkost@redhat.com \
--cc=fam@euphon.net \
--cc=imammedo@redhat.com \
--cc=jasowang@redhat.com \
--cc=jsnow@redhat.com \
--cc=lvivier@redhat.com \
--cc=mark.cave-ayland@ilande.co.uk \
--cc=mlevitsk@redhat.com \
--cc=mst@redhat.com \
--cc=pbonzini@redhat.com \
--cc=philmd@oss.qualcomm.com \
--cc=pierrick.bouvier@oss.qualcomm.com \
--cc=qemu-block@nongnu.org \
--cc=qemu-devel@nongnu.org \
--cc=stefanb@linux.ibm.com \
--cc=thuth@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.