qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: "Alex Bennée" <alex.bennee@linaro.org>
To: qemu-devel@nongnu.org
Cc: "Pierrick Bouvier" <pierrick.bouvier@linaro.org>,
	"Alex Bennée" <alex.bennee@linaro.org>,
	"Thomas Huth" <thuth@redhat.com>,
	"Paolo Bonzini" <pbonzini@redhat.com>,
	"Akihiko Odaki" <akihiko.odaki@daynix.com>,
	"John Snow" <jsnow@redhat.com>, "Fabiano Rosas" <farosas@suse.de>,
	"Peter Xu" <peterx@redhat.com>,
	"Marc-André Lureau" <marcandre.lureau@redhat.com>,
	"Alexandre Iooss" <erdnaxe@crans.org>,
	"Markus Armbruster" <armbru@redhat.com>,
	"David Hildenbrand" <david@redhat.com>,
	"Laurent Vivier" <lvivier@redhat.com>,
	"Daniel P. Berrangé" <berrange@redhat.com>,
	"Peter Maydell" <peter.maydell@linaro.org>,
	qemu-arm@nongnu.org, "Philippe Mathieu-Daudé" <philmd@linaro.org>,
	"Mahmoud Mandour" <ma.mandourr@gmail.com>,
	"Sriram Yagnaraman" <sriram.yagnaraman@ericsson.com>,
	"Dmitry Osipenko" <dmitry.osipenko@collabora.com>,
	"Gustavo Romero" <gustavo.romero@linaro.org>,
	"Michael S. Tsirkin" <mst@redhat.com>,
	"Manos Pitsidianakis" <manos.pitsidianakis@linaro.org>,
	qemu-stable@nongnu.org
Subject: [PATCH v3 12/20] virtio-gpu: fix hang under TCG when unmapping blob
Date: Wed, 21 May 2025 17:42:42 +0100	[thread overview]
Message-ID: <20250521164250.135776-13-alex.bennee@linaro.org> (raw)
In-Reply-To: <20250521164250.135776-1-alex.bennee@linaro.org>

From: Manos Pitsidianakis <manos.pitsidianakis@linaro.org>

This commit fixes an indefinite hang when using VIRTIO GPU blob objects
under TCG in certain conditions.

The VIRTIO_GPU_CMD_RESOURCE_MAP_BLOB VIRTIO command creates a
MemoryRegion and attaches it to an offset on a PCI BAR of the
VirtIOGPUdevice. The VIRTIO_GPU_CMD_RESOURCE_UNMAP_BLOB command unmaps
it.

Because virglrenderer commands are not thread-safe they are only
called on the main context and QEMU performs the cleanup in three steps
to prevent a use-after-free scenario where the guest can access the
region after it’s unmapped:

1. From the main context, the region’s field finish_unmapping is false
   by default, so it sets a variable cmd_suspended, increases the
   renderer_blocked variable, deletes the blob subregion, and unparents
   the blob subregion causing its reference count to decrement.

2. From an RCU context, the MemoryView gets freed, the FlatView gets
   recalculated, the free callback of the blob region
   virtio_gpu_virgl_hostmem_region_free is called which sets the
   region’s field finish_unmapping to true, allowing the main thread
   context to finish replying to the command

3. From the main context, the command is processed again, but this time
   finish_unmapping is true, so virgl_renderer_resource_unmap can be
   called and a response is sent to the guest.

It happens so that under TCG, if the guest has no timers configured (and
thus no interrupt will cause the CPU to exit), the RCU thread does not
have enough time to grab the locks and recalculate the FlatView.

That’s not a big problem in practice since most guests will assume a
response will happen later in time and go on to do different things,
potentially triggering interrupts and allowing the RCU context to run.
If the guest waits for the unmap command to complete though, it blocks
indefinitely. Attaching to the QEMU monitor and force quitting the guest
allows the cleanup to continue.

There's no reason why the FlatView recalculation can't occur right away
when we delete the blob subregion, however. It does not, because when we
create the subregion we set the object as its own parent:

    memory_region_init_ram_ptr(mr, OBJECT(mr), "blob", size, data);

The extra reference is what prevents freeing the memory region object in
the memory transaction of deleting the subregion.

This commit changes the owner object to the device, which removes the
extra owner reference in the memory region and causes the MR to be
freed right away in the main context.

Acked-by: Michael S. Tsirkin <mst@redhat.com>
Signed-off-by: Manos Pitsidianakis <manos.pitsidianakis@linaro.org>
Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
Tested-by: Alex Bennée <alex.bennee@linaro.org>
Message-Id: <20250410122643.1747913-3-manos.pitsidianakis@linaro.org>
Cc: qemu-stable@nongnu.org
---
 hw/display/virtio-gpu-virgl.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/hw/display/virtio-gpu-virgl.c b/hw/display/virtio-gpu-virgl.c
index 71a7500de9..8fbe4e70cc 100644
--- a/hw/display/virtio-gpu-virgl.c
+++ b/hw/display/virtio-gpu-virgl.c
@@ -112,7 +112,7 @@ virtio_gpu_virgl_map_resource_blob(VirtIOGPU *g,
     vmr->g = g;
     mr = g_new0(MemoryRegion, 1);
 
-    memory_region_init_ram_ptr(mr, OBJECT(mr), "blob", size, data);
+    memory_region_init_ram_ptr(mr, OBJECT(g), "blob", size, data);
     memory_region_add_subregion(&b->hostmem, offset, mr);
     memory_region_set_enabled(mr, true);
 
-- 
2.39.5



  parent reply	other threads:[~2025-05-21 16:44 UTC|newest]

Thread overview: 52+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-05-21 16:42 [PATCH v3 00/20] Maintainer updates for May (testing, plugins, virtio-gpu) - pre-PR Alex Bennée
2025-05-21 16:42 ` [PATCH v3 01/20] tests/docker: expose $HOME/.cache/qemu as docker volume Alex Bennée
2025-05-22  5:23   ` Thomas Huth
2025-05-21 16:42 ` [PATCH v3 02/20] gitlab: disable debug info on CI builds Alex Bennée
2025-05-21 16:42 ` [PATCH v3 03/20] tests/tcg: make aarch64 boot.S handle different starting modes Alex Bennée
2025-05-22  5:10   ` Akihiko Odaki
2025-05-21 16:42 ` [PATCH v3 04/20] tests/qtest: fix igb test failure under --enable-ubsan Alex Bennée
2025-05-22  5:19   ` Akihiko Odaki
2025-05-25 15:18     ` Philippe Mathieu-Daudé
2025-05-21 16:42 ` [PATCH v3 05/20] tests/Makefile: include test-plugins in per-arch build deps Alex Bennée
2025-05-22  5:37   ` Akihiko Odaki
2025-05-22 10:31     ` Alex Bennée
2025-05-22 10:35       ` Akihiko Odaki
2025-05-25 15:18         ` Philippe Mathieu-Daudé
2025-05-21 16:42 ` [PATCH v3 06/20] tests/functional: Add PCI hotplug test for aarch64 Alex Bennée
2025-05-22  5:43   ` Akihiko Odaki
2025-05-21 16:42 ` [PATCH v3 07/20] contrib/plugins: add a scaling factor to the ips arg Alex Bennée
2025-05-22  5:45   ` Akihiko Odaki
2025-05-25 15:19     ` Philippe Mathieu-Daudé
2025-05-21 16:42 ` [PATCH v3 08/20] contrib/plugins: allow setting of instructions per quantum Alex Bennée
2025-05-22  5:55   ` Akihiko Odaki
2025-05-21 16:42 ` [PATCH v3 09/20] MAINTAINERS: add myself to virtio-gpu for Odd Fixes Alex Bennée
2025-05-22  5:16   ` Markus Armbruster
2025-05-21 16:42 ` [PATCH v3 10/20] MAINTAINERS: add Akihiko and Dmitry as reviewers Alex Bennée
2025-05-22  5:16   ` Markus Armbruster
2025-05-24 11:07   ` Michael S. Tsirkin
2025-05-25 15:20   ` Philippe Mathieu-Daudé
2025-05-21 16:42 ` [PATCH v3 11/20] hw/display: re-arrange memory region tracking Alex Bennée
2025-05-21 16:42 ` Alex Bennée [this message]
2025-05-22  5:59   ` [PATCH v3 12/20] virtio-gpu: fix hang under TCG when unmapping blob Akihiko Odaki
2025-05-22  6:45     ` Alex Bennée
2025-05-22  7:02       ` Akihiko Odaki
2025-05-22  7:31         ` Manos Pitsidianakis
2025-05-22  7:40           ` Akihiko Odaki
2025-05-22  9:28             ` Alex Bennée
2025-05-22  9:54               ` Akihiko Odaki
2025-05-27 10:05                 ` Alex Bennée
2025-05-27 11:03                   ` Akihiko Odaki
2025-05-21 16:42 ` [PATCH v3 13/20] virtio-gpu: refactor async blob unmapping Alex Bennée
2025-05-21 16:42 ` [PATCH v3 14/20] ui/gtk-gl-area: Remove extra draw call in refresh Alex Bennée
2025-05-21 16:42 ` [PATCH v3 15/20] virtio-gpu: support context init multiple timeline Alex Bennée
2025-05-21 16:42 ` [PATCH v3 16/20] include/exec: fix assert in size_memop Alex Bennée
2025-05-22  6:07   ` Akihiko Odaki
2025-05-21 16:42 ` [PATCH v3 17/20] include/gdbstub: fix include guard in commands.h Alex Bennée
2025-05-21 16:42 ` [PATCH v3 18/20] gdbstub: assert earlier in handle_read_all_regs Alex Bennée
2025-05-21 16:42 ` [PATCH v3 19/20] gdbstub: Implement qGDBServerVersion packet Alex Bennée
2025-05-22  6:29   ` Akihiko Odaki
2025-05-22 10:05     ` Alex Bennée
2025-05-22 10:15       ` Akihiko Odaki
2025-05-25 15:25   ` Philippe Mathieu-Daudé
2025-05-27  9:16     ` Alex Bennée
2025-05-21 16:42 ` [PATCH v3 20/20] gdbstub: update aarch64-core.xml Alex Bennée

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20250521164250.135776-13-alex.bennee@linaro.org \
    --to=alex.bennee@linaro.org \
    --cc=akihiko.odaki@daynix.com \
    --cc=armbru@redhat.com \
    --cc=berrange@redhat.com \
    --cc=david@redhat.com \
    --cc=dmitry.osipenko@collabora.com \
    --cc=erdnaxe@crans.org \
    --cc=farosas@suse.de \
    --cc=gustavo.romero@linaro.org \
    --cc=jsnow@redhat.com \
    --cc=lvivier@redhat.com \
    --cc=ma.mandourr@gmail.com \
    --cc=manos.pitsidianakis@linaro.org \
    --cc=marcandre.lureau@redhat.com \
    --cc=mst@redhat.com \
    --cc=pbonzini@redhat.com \
    --cc=peter.maydell@linaro.org \
    --cc=peterx@redhat.com \
    --cc=philmd@linaro.org \
    --cc=pierrick.bouvier@linaro.org \
    --cc=qemu-arm@nongnu.org \
    --cc=qemu-devel@nongnu.org \
    --cc=qemu-stable@nongnu.org \
    --cc=sriram.yagnaraman@ericsson.com \
    --cc=thuth@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).