qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
* Re: [Qemu-devel] [PATCH] aio: add missing aio_notify() to aio_enable_external()
       [not found] <20170504102339.31971-1-stefanha@redhat.com>
@ 2017-05-04 10:36 ` Paolo Bonzini
  2017-05-04 10:54   ` Fam Zheng
  0 siblings, 1 reply; 3+ messages in thread
From: Paolo Bonzini @ 2017-05-04 10:36 UTC (permalink / raw)
  To: Stefan Hajnoczi, qemu-devel; +Cc: qemu-block, Fam Zheng, qemu-stable



On 04/05/2017 12:23, Stefan Hajnoczi wrote:
> The main loop uses aio_disable_external()/aio_enable_external() to
> temporarily disable processing of external AioContext clients like
> device emulation.
> 
> This allows monitor commands to quiesce I/O and prevent the guest from
> submitting new requests while a monitor command is in progress.
> 
> The aio_enable_external() API is currently broken when an IOThread is in
> aio_poll() waiting for fd activity when the main loop re-enables
> external clients.  Incrementing ctx->external_disable_cnt does not wake
> the IOThread from ppoll(2) so fd processing remains suspended and leads
> to unresponsive emulated devices.
> 
> This patch adds an aio_notify() call to aio_enable_external() so the
> IOThread is kicked out of ppoll(2) and will re-arm the file descriptors.
> 
> The bug can be reproduced as follows:
> 
>   $ qemu -M accel=kvm -m 1024 \
>          -object iothread,id=iothread0 \
>          -device virtio-scsi-pci,iothread=iothread0,id=virtio-scsi-pci0 \
>          -drive if=none,id=drive0,aio=native,cache=none,format=raw,file=test.img \
>          -device scsi-hd,id=scsi-hd0,drive=drive0 \
>          -qmp tcp::5555,server,nowait
> 
>   $ scripts/qmp/qmp-shell localhost:5555
>   (qemu) blockdev-snapshot-sync device=drive0 snapshot-file=sn1.qcow2
>          mode=absolute-paths format=qcow2
> 
> After blockdev-snapshot-sync completes the SCSI disk will be
> unresponsive.  This leads to request timeouts inside the guest.

I agree this is the minimal fix and is the right thing to do.  The
bdrv_drained_begin/end device callbacks would also make it possible to
remove disable/enable external altogether, but that's more invasive.

Reviewed-by: Paolo Bonzini <pbonzini@redhat.com>
Cc: qemu-stable@nongnu.org

> Reported-by: Qianqian Zhu <qizhu@redhat.com>
> Suggested-by: Fam Zheng <famz@redhat.com>
> Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>
> ---
>  include/block/aio.h | 1 +
>  1 file changed, 1 insertion(+)
> 
> diff --git a/include/block/aio.h b/include/block/aio.h
> index 406e323..5294b04 100644
> --- a/include/block/aio.h
> +++ b/include/block/aio.h
> @@ -456,6 +456,7 @@ static inline void aio_enable_external(AioContext *ctx)
>  {
>      assert(ctx->external_disable_cnt > 0);
>      atomic_dec(&ctx->external_disable_cnt);
> +    aio_notify(ctx);
>  }
>  
>  /**
> 

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [Qemu-devel] [PATCH] aio: add missing aio_notify() to aio_enable_external()
  2017-05-04 10:36 ` [Qemu-devel] [PATCH] aio: add missing aio_notify() to aio_enable_external() Paolo Bonzini
@ 2017-05-04 10:54   ` Fam Zheng
  2017-05-05 10:03     ` Stefan Hajnoczi
  0 siblings, 1 reply; 3+ messages in thread
From: Fam Zheng @ 2017-05-04 10:54 UTC (permalink / raw)
  To: Paolo Bonzini; +Cc: Stefan Hajnoczi, qemu-devel, qemu-block, qemu-stable

On Thu, 05/04 12:36, Paolo Bonzini wrote:
> 
> 
> On 04/05/2017 12:23, Stefan Hajnoczi wrote:
> > The main loop uses aio_disable_external()/aio_enable_external() to
> > temporarily disable processing of external AioContext clients like
> > device emulation.
> > 
> > This allows monitor commands to quiesce I/O and prevent the guest from
> > submitting new requests while a monitor command is in progress.
> > 
> > The aio_enable_external() API is currently broken when an IOThread is in
> > aio_poll() waiting for fd activity when the main loop re-enables
> > external clients.  Incrementing ctx->external_disable_cnt does not wake
> > the IOThread from ppoll(2) so fd processing remains suspended and leads
> > to unresponsive emulated devices.
> > 
> > This patch adds an aio_notify() call to aio_enable_external() so the
> > IOThread is kicked out of ppoll(2) and will re-arm the file descriptors.
> > 
> > The bug can be reproduced as follows:
> > 
> >   $ qemu -M accel=kvm -m 1024 \
> >          -object iothread,id=iothread0 \
> >          -device virtio-scsi-pci,iothread=iothread0,id=virtio-scsi-pci0 \
> >          -drive if=none,id=drive0,aio=native,cache=none,format=raw,file=test.img \
> >          -device scsi-hd,id=scsi-hd0,drive=drive0 \
> >          -qmp tcp::5555,server,nowait
> > 
> >   $ scripts/qmp/qmp-shell localhost:5555
> >   (qemu) blockdev-snapshot-sync device=drive0 snapshot-file=sn1.qcow2
> >          mode=absolute-paths format=qcow2
> > 
> > After blockdev-snapshot-sync completes the SCSI disk will be
> > unresponsive.  This leads to request timeouts inside the guest.
> 
> I agree this is the minimal fix and is the right thing to do.  The
> bdrv_drained_begin/end device callbacks would also make it possible to
> remove disable/enable external altogether, but that's more invasive.
> 
> Reviewed-by: Paolo Bonzini <pbonzini@redhat.com>
> Cc: qemu-stable@nongnu.org
> 
> > Reported-by: Qianqian Zhu <qizhu@redhat.com>
> > Suggested-by: Fam Zheng <famz@redhat.com>
> > Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com>
> > ---
> >  include/block/aio.h | 1 +
> >  1 file changed, 1 insertion(+)
> > 
> > diff --git a/include/block/aio.h b/include/block/aio.h
> > index 406e323..5294b04 100644
> > --- a/include/block/aio.h
> > +++ b/include/block/aio.h
> > @@ -456,6 +456,7 @@ static inline void aio_enable_external(AioContext *ctx)
> >  {
> >      assert(ctx->external_disable_cnt > 0);
> >      atomic_dec(&ctx->external_disable_cnt);

This can be changed to atomic_fetch_dec and only call aio_notify if it returned
1, which is cleaner.

> > +    aio_notify(ctx);
> >  }
> >  
> >  /**
> > 

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [Qemu-devel] [PATCH] aio: add missing aio_notify() to aio_enable_external()
  2017-05-04 10:54   ` Fam Zheng
@ 2017-05-05 10:03     ` Stefan Hajnoczi
  0 siblings, 0 replies; 3+ messages in thread
From: Stefan Hajnoczi @ 2017-05-05 10:03 UTC (permalink / raw)
  To: Fam Zheng; +Cc: Paolo Bonzini, qemu-devel, qemu-block, qemu-stable

[-- Attachment #1: Type: text/plain, Size: 665 bytes --]

On Thu, May 04, 2017 at 06:54:58PM +0800, Fam Zheng wrote:
> On Thu, 05/04 12:36, Paolo Bonzini wrote:
> > On 04/05/2017 12:23, Stefan Hajnoczi wrote:
> > > diff --git a/include/block/aio.h b/include/block/aio.h
> > > index 406e323..5294b04 100644
> > > --- a/include/block/aio.h
> > > +++ b/include/block/aio.h
> > > @@ -456,6 +456,7 @@ static inline void aio_enable_external(AioContext *ctx)
> > >  {
> > >      assert(ctx->external_disable_cnt > 0);
> > >      atomic_dec(&ctx->external_disable_cnt);
> 
> This can be changed to atomic_fetch_dec and only call aio_notify if it returned
> 1, which is cleaner.

Nice idea.  Will send v2.

Stefan

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 455 bytes --]

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2017-05-05 10:03 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
     [not found] <20170504102339.31971-1-stefanha@redhat.com>
2017-05-04 10:36 ` [Qemu-devel] [PATCH] aio: add missing aio_notify() to aio_enable_external() Paolo Bonzini
2017-05-04 10:54   ` Fam Zheng
2017-05-05 10:03     ` Stefan Hajnoczi

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).