From: tu bo <tubo@linux.vnet.ibm.com>
To: Fam Zheng <famz@redhat.com>, Paolo Bonzini <pbonzini@redhat.com>
Cc: Christian Borntraeger <borntraeger@de.ibm.com>, qemu-devel@nongnu.org
Subject: Re: [Qemu-devel] [PATCH 5/8] virtio-blk: fix "disabled data plane" mode
Date: Wed, 16 Mar 2016 13:21:54 +0800 [thread overview]
Message-ID: <56E8ED72.4030004@linux.vnet.ibm.com> (raw)
In-Reply-To: <20160315124530.GA10718@ad.usersys.redhat.com>
On 03/15/2016 08:45 PM, Fam Zheng wrote:
> On Fri, 03/11 11:28, Paolo Bonzini wrote:
>>
>>
>> On 10/03/2016 10:40, Christian Borntraeger wrote:
>>> On 03/10/2016 10:03 AM, Christian Borntraeger wrote:
>>>> On 03/10/2016 02:51 AM, Fam Zheng wrote:
>>>> [...]
>>>>> The aio_poll() inside "blk_set_aio_context(s->conf->conf.blk, s->ctx)" looks
>>>>> suspicious:
>>>>>
>>>>> main thread iothread
>>>>> ----------------------------------------------------------------------------
>>>>> virtio_blk_handle_output()
>>>>> virtio_blk_data_plane_start()
>>>>> vblk->dataplane_started = true;
>>>>> blk_set_aio_context()
>>>>> bdrv_set_aio_context()
>>>>> bdrv_drain()
>>>>> aio_poll()
>>>>> <snip...>
>>>>> virtio_blk_handle_output()
>>>>> /* s->dataplane_started is true */
>>>>> !!! -> virtio_blk_handle_request()
>>>>> event_notifier_set(ioeventfd)
>>>>> aio_poll()
>>>>> virtio_blk_handle_request()
>>>>>
>>>>> Christian, could you try the followed patch? The aio_poll above is replaced
>>>>> with a "limited aio_poll" that doesn't disptach ioeventfd.
>>>>>
>>>>> (Note: perhaps moving "vblk->dataplane_started = true;" after
>>>>> blk_set_aio_context() also *works around* this.)
>>>>>
>>>>> ---
>>>>>
>>>>> diff --git a/block.c b/block.c
>>>>> index ba24b8e..e37e8f7 100644
>>>>> --- a/block.c
>>>>> +++ b/block.c
>>>>> @@ -4093,7 +4093,9 @@ void bdrv_attach_aio_context(BlockDriverState *bs,
>>>>>
>>>>> void bdrv_set_aio_context(BlockDriverState *bs, AioContext *new_context)
>>>>> {
>>>>> - bdrv_drain(bs); /* ensure there are no in-flight requests */
>>>>> + /* ensure there are no in-flight requests */
>>>>> + bdrv_drained_begin(bs);
>>>>> + bdrv_drained_end(bs);
>>>>>
>>>>> bdrv_detach_aio_context(bs);
>>>>>
>>>>
>>>> That seems to do the trick.
>>>
>>> Or not. Crashed again :-(
>>
>> I would put bdrv_drained_end just before aio_context_release.
>
> This won't work. bdrv_drained_end must be called with the same ctx as
> bdrv_drained_begin, which is only true before bdrv_detach_aio_context().
>
>>
>> But secondarily, I'm thinking of making the logic simpler to understand
>> in two ways:
>>
>> 1) adding a mutex around virtio_blk_data_plane_start/stop.
>>
>> 2) moving
>>
>> event_notifier_set(virtio_queue_get_host_notifier(s->vq));
>> virtio_queue_aio_set_host_notifier_handler(s->vq, s->ctx, true, true);
>>
>> to a bottom half (created with aio_bh_new in s->ctx). The bottom half
>> takes the mutex, checks again "if (vblk->dataplane_started)" and if it's
>> true starts the processing.
>
> Like this? If it captures your idea, could Bo or Christian help test?
>
>
With this patch, I still can get qemu crash as before,
(gdb) bt
#0 bdrv_co_do_rw (opaque=0x0) at block/io.c:2172
#1 0x000002aa17f5a4a6 in coroutine_trampoline (i0=<optimized out>,
i1=-1677707808) at util/coroutine-ucontext.c:79
#2 0x000003ffac25150a in __makecontext_ret () from /lib64/libc.so.6
Good news is that frequency of qemu crash is much less that before.
---
>
> From b5b8886693828d498ee184fc7d4e13d8c06cdf39 Mon Sep 17 00:00:00 2001
> From: Fam Zheng <famz@redhat.com>
> Date: Thu, 10 Mar 2016 10:26:36 +0800
> Subject: [PATCH] virtio-blk dataplane start crash fix
>
> Suggested-by: Paolo Bonzini <pbonzini@redhat.com>
> Signed-off-by: Fam Zheng <famz@redhat.com>
> ---
> block.c | 4 +++-
> hw/block/dataplane/virtio-blk.c | 39 ++++++++++++++++++++++++++++++++-------
> 2 files changed, 35 insertions(+), 8 deletions(-)
>
> diff --git a/block.c b/block.c
> index ba24b8e..e37e8f7 100644
> --- a/block.c
> +++ b/block.c
> @@ -4093,7 +4093,9 @@ void bdrv_attach_aio_context(BlockDriverState *bs,
>
> void bdrv_set_aio_context(BlockDriverState *bs, AioContext *new_context)
> {
> - bdrv_drain(bs); /* ensure there are no in-flight requests */
> + /* ensure there are no in-flight requests */
> + bdrv_drained_begin(bs);
> + bdrv_drained_end(bs);
>
> bdrv_detach_aio_context(bs);
>
> diff --git a/hw/block/dataplane/virtio-blk.c b/hw/block/dataplane/virtio-blk.c
> index 36f3d2b..6db5c22 100644
> --- a/hw/block/dataplane/virtio-blk.c
> +++ b/hw/block/dataplane/virtio-blk.c
> @@ -49,6 +49,8 @@ struct VirtIOBlockDataPlane {
>
> /* Operation blocker on BDS */
> Error *blocker;
> +
> + QemuMutex start_lock;
> };
>
> /* Raise an interrupt to signal guest, if necessary */
> @@ -150,6 +152,7 @@ void virtio_blk_data_plane_create(VirtIODevice *vdev, VirtIOBlkConf *conf,
> s = g_new0(VirtIOBlockDataPlane, 1);
> s->vdev = vdev;
> s->conf = conf;
> + qemu_mutex_init(&s->start_lock);
>
> if (conf->iothread) {
> s->iothread = conf->iothread;
> @@ -184,15 +187,38 @@ void virtio_blk_data_plane_destroy(VirtIOBlockDataPlane *s)
> g_free(s);
> }
>
> +typedef struct {
> + VirtIOBlockDataPlane *s;
> + QEMUBH *bh;
> +} VirtIOBlockStartData;
> +
> +static void virtio_blk_data_plane_start_bh_cb(void *opaque)
> +{
> + VirtIOBlockStartData *data = opaque;
> + VirtIOBlockDataPlane *s = data->s;
> +
> + /* Kick right away to begin processing requests already in vring */
> + event_notifier_set(virtio_queue_get_host_notifier(s->vq));
> +
> + /* Get this show started by hooking up our callbacks */
> + virtio_queue_aio_set_host_notifier_handler(s->vq, s->ctx, true, true);
> +
> + qemu_bh_delete(data->bh);
> + g_free(data);
> +}
> +
> /* Context: QEMU global mutex held */
> void virtio_blk_data_plane_start(VirtIOBlockDataPlane *s)
> {
> BusState *qbus = BUS(qdev_get_parent_bus(DEVICE(s->vdev)));
> VirtioBusClass *k = VIRTIO_BUS_GET_CLASS(qbus);
> VirtIOBlock *vblk = VIRTIO_BLK(s->vdev);
> + VirtIOBlockStartData *data;
> int r;
>
> + qemu_mutex_lock(&s->start_lock);
> if (vblk->dataplane_started || s->starting) {
> + qemu_mutex_unlock(&s->start_lock);
> return;
> }
>
> @@ -221,13 +247,11 @@ void virtio_blk_data_plane_start(VirtIOBlockDataPlane *s)
>
> blk_set_aio_context(s->conf->conf.blk, s->ctx);
>
> - /* Kick right away to begin processing requests already in vring */
> - event_notifier_set(virtio_queue_get_host_notifier(s->vq));
> -
> - /* Get this show started by hooking up our callbacks */
> - aio_context_acquire(s->ctx);
> - virtio_queue_aio_set_host_notifier_handler(s->vq, s->ctx, true, true);
> - aio_context_release(s->ctx);
> + data = g_new(VirtIOBlockStartData, 1);
> + data->s = s;
> + data->bh = aio_bh_new(s->ctx, virtio_blk_data_plane_start_bh_cb, data);
> + qemu_bh_schedule(data->bh);
> + qemu_mutex_unlock(&s->start_lock);
> return;
>
> fail_host_notifier:
> @@ -236,6 +260,7 @@ void virtio_blk_data_plane_start(VirtIOBlockDataPlane *s)
> s->disabled = true;
> s->starting = false;
> vblk->dataplane_started = true;
> + qemu_mutex_unlock(&s->start_lock);
> }
>
> /* Context: QEMU global mutex held */
>
next prev parent reply other threads:[~2016-03-16 5:22 UTC|newest]
Thread overview: 48+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-02-14 17:17 [Qemu-devel] [PATCH 0/8] virtio: allow migration with dataplane Paolo Bonzini
2016-02-14 17:17 ` [Qemu-devel] [PATCH 1/8] block-migration: acquire AioContext as necessary Paolo Bonzini
2016-02-16 7:17 ` Fam Zheng
2016-02-19 7:41 ` Michael S. Tsirkin
2016-02-19 15:02 ` Paolo Bonzini
2016-02-14 17:17 ` [Qemu-devel] [PATCH 2/8] vring: make vring_enable_notification return void Paolo Bonzini
2016-02-15 16:24 ` Cornelia Huck
2016-02-16 7:18 ` Fam Zheng
2016-02-14 17:17 ` [Qemu-devel] [PATCH 3/8] virtio: add AioContext-specific function for host notifiers Paolo Bonzini
2016-02-16 7:20 ` Fam Zheng
2016-02-16 9:42 ` Cornelia Huck
2016-02-14 17:17 ` [Qemu-devel] [PATCH 4/8] virtio: export vring_notify as virtio_should_notify Paolo Bonzini
2016-02-15 16:44 ` Cornelia Huck
2016-02-16 7:21 ` Fam Zheng
2016-02-14 17:17 ` [Qemu-devel] [PATCH 5/8] virtio-blk: fix "disabled data plane" mode Paolo Bonzini
2016-02-15 17:58 ` Cornelia Huck
2016-02-16 15:45 ` Paolo Bonzini
2016-02-16 16:15 ` Cornelia Huck
2016-02-16 16:29 ` Paolo Bonzini
2016-02-16 7:26 ` Fam Zheng
2016-03-09 12:21 ` Christian Borntraeger
2016-03-09 12:55 ` Paolo Bonzini
2016-03-09 13:02 ` Christian Borntraeger
2016-03-09 13:05 ` Christian Borntraeger
2016-03-09 13:12 ` Cornelia Huck
2016-03-09 14:29 ` Christian Borntraeger
2016-03-09 16:17 ` Paolo Bonzini
2016-03-10 1:51 ` Fam Zheng
2016-03-10 9:03 ` Christian Borntraeger
2016-03-10 9:40 ` Christian Borntraeger
2016-03-11 10:28 ` Paolo Bonzini
2016-03-14 9:18 ` tu bo
2016-03-15 12:45 ` Fam Zheng
2016-03-15 13:18 ` Cornelia Huck
2016-03-15 14:08 ` Paolo Bonzini
2016-03-15 23:34 ` Fam Zheng
2016-03-16 5:21 ` tu bo [this message]
2016-02-14 17:17 ` [Qemu-devel] [PATCH 6/8] virtio-blk: do not use vring in dataplane Paolo Bonzini
2016-02-16 8:15 ` Fam Zheng
2016-02-14 17:17 ` [Qemu-devel] [PATCH 7/8] virtio-scsi: " Paolo Bonzini
2016-02-16 8:28 ` Fam Zheng
2016-02-14 17:17 ` [Qemu-devel] [PATCH 8/8] vring: remove Paolo Bonzini
2016-02-16 8:32 ` Fam Zheng
2016-02-16 8:57 ` [Qemu-devel] [PATCH 0/8] virtio: allow migration with dataplane Christian Borntraeger
2016-02-16 16:25 ` Paolo Bonzini
2016-02-19 7:42 ` Michael S. Tsirkin
2016-02-19 14:59 ` Paolo Bonzini
2016-02-22 16:01 ` Stefan Hajnoczi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=56E8ED72.4030004@linux.vnet.ibm.com \
--to=tubo@linux.vnet.ibm.com \
--cc=borntraeger@de.ibm.com \
--cc=famz@redhat.com \
--cc=pbonzini@redhat.com \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.