* Re: [Qemu-devel] [PATCH] aio: add missing aio_notify() to aio_enable_external()
[not found] <20170504102339.31971-1-stefanha@redhat.com>
@ 2017-05-04 10:36 ` Paolo Bonzini
2017-05-04 10:54 ` Fam Zheng
0 siblings, 1 reply; 3+ messages in thread
From: Paolo Bonzini @ 2017-05-04 10:36 UTC (permalink / raw)
To: Stefan Hajnoczi, qemu-devel; +Cc: qemu-block, Fam Zheng, qemu-stable
On 04/05/2017 12:23, Stefan Hajnoczi wrote:
> The main loop uses aio_disable_external()/aio_enable_external() to
> temporarily disable processing of external AioContext clients like
> device emulation.
>
> This allows monitor commands to quiesce I/O and prevent the guest from
> submitting new requests while a monitor command is in progress.
>
> The aio_enable_external() API is currently broken when an IOThread is in
> aio_poll() waiting for fd activity when the main loop re-enables
> external clients. Incrementing ctx->external_disable_cnt does not wake
> the IOThread from ppoll(2) so fd processing remains suspended and leads
> to unresponsive emulated devices.
>
> This patch adds an aio_notify() call to aio_enable_external() so the
> IOThread is kicked out of ppoll(2) and will re-arm the file descriptors.
>
> The bug can be reproduced as follows:
>
> $ qemu -M accel=kvm -m 1024 \
> -object iothread,id=iothread0 \
> -device virtio-scsi-pci,iothread=iothread0,id=virtio-scsi-pci0 \
> -drive if=none,id=drive0,aio=native,cache=none,format=raw,file=test.img \
> -device scsi-hd,id=scsi-hd0,drive=drive0 \
> -qmp tcp::5555,server,nowait
>
> $ scripts/qmp/qmp-shell localhost:5555
> (qemu) blockdev-snapshot-sync device=drive0 snapshot-file=sn1.qcow2
> mode=absolute-paths format=qcow2
>
> After blockdev-snapshot-sync completes the SCSI disk will be
> unresponsive. This leads to request timeouts inside the guest.
I agree this is the minimal fix and is the right thing to do. The
bdrv_drained_begin/end device callbacks would also make it possible to
remove disable/enable external altogether, but that's more invasive.
Reviewed-by: Paolo Bonzini <pbonzini@redhat.com>
Cc: qemu-stable@nongnu.org
> Reported-by: Qianqian Zhu <qizhu@redhat.com>
> Suggested-by: Fam Zheng <famz@redhat.com>
> Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>
> ---
> include/block/aio.h | 1 +
> 1 file changed, 1 insertion(+)
>
> diff --git a/include/block/aio.h b/include/block/aio.h
> index 406e323..5294b04 100644
> --- a/include/block/aio.h
> +++ b/include/block/aio.h
> @@ -456,6 +456,7 @@ static inline void aio_enable_external(AioContext *ctx)
> {
> assert(ctx->external_disable_cnt > 0);
> atomic_dec(&ctx->external_disable_cnt);
> + aio_notify(ctx);
> }
>
> /**
>
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [Qemu-devel] [PATCH] aio: add missing aio_notify() to aio_enable_external()
2017-05-04 10:36 ` [Qemu-devel] [PATCH] aio: add missing aio_notify() to aio_enable_external() Paolo Bonzini
@ 2017-05-04 10:54 ` Fam Zheng
2017-05-05 10:03 ` Stefan Hajnoczi
0 siblings, 1 reply; 3+ messages in thread
From: Fam Zheng @ 2017-05-04 10:54 UTC (permalink / raw)
To: Paolo Bonzini; +Cc: Stefan Hajnoczi, qemu-devel, qemu-block, qemu-stable
On Thu, 05/04 12:36, Paolo Bonzini wrote:
>
>
> On 04/05/2017 12:23, Stefan Hajnoczi wrote:
> > The main loop uses aio_disable_external()/aio_enable_external() to
> > temporarily disable processing of external AioContext clients like
> > device emulation.
> >
> > This allows monitor commands to quiesce I/O and prevent the guest from
> > submitting new requests while a monitor command is in progress.
> >
> > The aio_enable_external() API is currently broken when an IOThread is in
> > aio_poll() waiting for fd activity when the main loop re-enables
> > external clients. Incrementing ctx->external_disable_cnt does not wake
> > the IOThread from ppoll(2) so fd processing remains suspended and leads
> > to unresponsive emulated devices.
> >
> > This patch adds an aio_notify() call to aio_enable_external() so the
> > IOThread is kicked out of ppoll(2) and will re-arm the file descriptors.
> >
> > The bug can be reproduced as follows:
> >
> > $ qemu -M accel=kvm -m 1024 \
> > -object iothread,id=iothread0 \
> > -device virtio-scsi-pci,iothread=iothread0,id=virtio-scsi-pci0 \
> > -drive if=none,id=drive0,aio=native,cache=none,format=raw,file=test.img \
> > -device scsi-hd,id=scsi-hd0,drive=drive0 \
> > -qmp tcp::5555,server,nowait
> >
> > $ scripts/qmp/qmp-shell localhost:5555
> > (qemu) blockdev-snapshot-sync device=drive0 snapshot-file=sn1.qcow2
> > mode=absolute-paths format=qcow2
> >
> > After blockdev-snapshot-sync completes the SCSI disk will be
> > unresponsive. This leads to request timeouts inside the guest.
>
> I agree this is the minimal fix and is the right thing to do. The
> bdrv_drained_begin/end device callbacks would also make it possible to
> remove disable/enable external altogether, but that's more invasive.
>
> Reviewed-by: Paolo Bonzini <pbonzini@redhat.com>
> Cc: qemu-stable@nongnu.org
>
> > Reported-by: Qianqian Zhu <qizhu@redhat.com>
> > Suggested-by: Fam Zheng <famz@redhat.com>
> > Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>
> > ---
> > include/block/aio.h | 1 +
> > 1 file changed, 1 insertion(+)
> >
> > diff --git a/include/block/aio.h b/include/block/aio.h
> > index 406e323..5294b04 100644
> > --- a/include/block/aio.h
> > +++ b/include/block/aio.h
> > @@ -456,6 +456,7 @@ static inline void aio_enable_external(AioContext *ctx)
> > {
> > assert(ctx->external_disable_cnt > 0);
> > atomic_dec(&ctx->external_disable_cnt);
This can be changed to atomic_fetch_dec and only call aio_notify if it returned
1, which is cleaner.
> > + aio_notify(ctx);
> > }
> >
> > /**
> >
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [Qemu-devel] [PATCH] aio: add missing aio_notify() to aio_enable_external()
2017-05-04 10:54 ` Fam Zheng
@ 2017-05-05 10:03 ` Stefan Hajnoczi
0 siblings, 0 replies; 3+ messages in thread
From: Stefan Hajnoczi @ 2017-05-05 10:03 UTC (permalink / raw)
To: Fam Zheng; +Cc: Paolo Bonzini, qemu-devel, qemu-block, qemu-stable
[-- Attachment #1: Type: text/plain, Size: 665 bytes --]
On Thu, May 04, 2017 at 06:54:58PM +0800, Fam Zheng wrote:
> On Thu, 05/04 12:36, Paolo Bonzini wrote:
> > On 04/05/2017 12:23, Stefan Hajnoczi wrote:
> > > diff --git a/include/block/aio.h b/include/block/aio.h
> > > index 406e323..5294b04 100644
> > > --- a/include/block/aio.h
> > > +++ b/include/block/aio.h
> > > @@ -456,6 +456,7 @@ static inline void aio_enable_external(AioContext *ctx)
> > > {
> > > assert(ctx->external_disable_cnt > 0);
> > > atomic_dec(&ctx->external_disable_cnt);
>
> This can be changed to atomic_fetch_dec and only call aio_notify if it returned
> 1, which is cleaner.
Nice idea. Will send v2.
Stefan
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 455 bytes --]
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2017-05-05 10:03 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <20170504102339.31971-1-stefanha@redhat.com>
2017-05-04 10:36 ` [Qemu-devel] [PATCH] aio: add missing aio_notify() to aio_enable_external() Paolo Bonzini
2017-05-04 10:54 ` Fam Zheng
2017-05-05 10:03 ` Stefan Hajnoczi
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).