* [Qemu-devel] vhost-user slave deadlock question
@ 2019-03-01 19:05 Dr. David Alan Gilbert
0 siblings, 0 replies; only message in thread
From: Dr. David Alan Gilbert @ 2019-03-01 19:05 UTC (permalink / raw)
To: qemu-devel, tiwei.bie, marcandre.lureau; +Cc: stefanha, maxime.coquelin
Hi,
I've added a few commands to vhost-user for virtio-fs and
am hitting a deadlock and am trying to figure out what the correct fix
is; suggestions welcome.
My setup is:
Messages sent over the virtio queues can cause the daemon to need
to send a request back to qemu along the slave, and qemu must respond with an
OK/error. Lets call this command 'setupmapping'.
In my case I'm reading vhost-user commands in one thread and
processing the queues in another.
That normally works OK
My problem:
If qemu crashes or quits it stops the queues synchronously at a point
when the main loop in qemu wont respond to anything else. However
if we're unlucky the daemon has already sent a message to qemu and
is waiting for the response; but that response can't arrive because
qemu is shutting down. So the queue shutdown request never completes.
Then if I kill the daemon forcibly, qemu's handler for the slavefd
wakes up and tries to read data - but it's device has gone and it
crashes.
The trace is:
(Where vuf_* is my device and the structure is pretty much the same
as the others).
vm_state_notify->virtio_set_status->vuf_set_status->vuf_stop->vhost_dev_stop->vhost_virtqueue_stop->vhost_user_get_vring_base->vhost_user_read
So it feels like we need to shut down the slave FD when we shut
down the device; but it's not clear to me at what level.
In some ways it feels like we need a way to get out if this
hole even if we shut down one queue synchronously.
Is anyone fighting similar cases?
Dave
--
Dr. David Alan Gilbert / dgilbert@redhat.com / Manchester, UK
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2019-03-01 19:05 UTC | newest]
Thread overview: (only message) (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2019-03-01 19:05 [Qemu-devel] vhost-user slave deadlock question Dr. David Alan Gilbert
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).