qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Fam Zheng <famz@redhat.com>
To: Zhang Haoyu <zhanghy@sangfor.com>
Cc: Stefan Hajnoczi <stefanha@gmail.com>,
	qemu-devel <qemu-devel@nongnu.org>, Bin Wu <wu.wubin@huawei.com>
Subject: Re: [Qemu-devel] the whole virtual machine hangs when IO does notcome back!
Date: Tue, 12 Aug 2014 10:27:12 +0800	[thread overview]
Message-ID: <20140812022712.GA23160@T430.nay.redhat.com> (raw)
In-Reply-To: <201408121009071107044@sangfor.com>

On Tue, 08/12 10:09, Zhang Haoyu wrote:
> >> > Hi,
> >> > 
> >> > I tested the reliability of qemu in the IPSAN environment as follows:
> >> > (1) create one VM on a X86 server which is connected to an IPSAN, and the VM
> >> > has only one system volume which is on the IPSAN;
> >> > (2) disconnect the network between the server and the IPSAN. On the server,
> >> > I have a "multipath" software which can hold the IO for a long time
> >> > (configurable) when the network is disconnected;
> >> > (3) about 30 seconds later, the whole VM hangs there, nothing can be done to
> >> > the VM!
> >> > 
> >> > Then, I used "gstack" tool to collect the stacks of all qemu threads, it
> >> > looked like:
> >> > 
> >> > Thread 8 (Thread 0x7fd840bb5700 (LWP 6671)):
> >> > #0  0x00007fd84253a4f6 in poll () from /lib64/libc.so.6
> >> > #1  0x00007fd84410ceff in aio_poll ()
> >> > #2  0x00007fd84429bb05 in qemu_aio_wait ()
> >> > #3  0x00007fd844120f51 in bdrv_drain_all ()
> >> > #4  0x00007fd8441f1a4a in bmdma_cmd_writeb ()
> >> > #5  0x00007fd8441f216e in bmdma_write ()
> >> > #6  0x00007fd8443a93cf in memory_region_write_accessor ()
> >> > #7  0x00007fd8443a94a6 in access_with_adjusted_size ()
> >> > #8  0x00007fd8443a9901 in memory_region_iorange_write ()
> >> > #9  0x00007fd8443a19bd in ioport_writeb_thunk ()
> >> > #10 0x00007fd8443a13a8 in ioport_write ()
> >> > #11 0x00007fd8443a1f55 in cpu_outb ()
> >> > #12 0x00007fd8443a5b12 in kvm_handle_io ()
> >> > #13 0x00007fd8443a64a9 in kvm_cpu_exec ()
> >> > #14 0x00007fd844330962 in qemu_kvm_cpu_thread_fn ()
> >> > #15 0x00007fd8427e77b6 in start_thread () from /lib64/libpthread.so.0
> >> > #16 0x00007fd8425439cd in clone () from /lib64/libc.so.6
> >> > #17 0x0000000000000000 in ?? ()
> >> 
> >> Use virtio-blk.  Read, write, and flush are asynchronous in virtio-blk.
> >> 
> >> Note that the QEMU monitor commands are typically synchronous so they
> >> will still block the VM.
> >> 
> >
> >If some of the requests are dropped by host and never return to QEMU, I think
> >bdrv_drain_all() will still cause the hang. Even with virtio-blk, reset has
> >such a call. Maybe we could add some -ETIMEDOUT machanism in QEMU's block
> >layer.
> >
> >A workaround might be to configure the host storage to fail the IO after a
> >timeout.
> >
> If -ETIMEOUT returned after a short time network disconnection, may unpredicted fault happened in VM ?
> e.g., the VM was reading important data(like, system data).
> Does aio replay work for this case?

Guest should do error handling with it, in a way similar to -EIO. The
connection is still down even if guest is free to retry, isn't it?

Fam

  reply	other threads:[~2014-08-12  2:27 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-08-11  8:33 [Qemu-devel] the whole virtual machine hangs when IO does not come back! Bin Wu
2014-08-11 14:21 ` Stefan Hajnoczi
2014-08-12  0:58   ` Fam Zheng
2014-08-12  2:09     ` [Qemu-devel] the whole virtual machine hangs when IO does notcome back! Zhang Haoyu
2014-08-12  2:27       ` Fam Zheng [this message]
2014-08-12  1:10   ` [Qemu-devel] the whole virtual machine hangs when IO does not come back! Bin Wu
2014-09-08  8:35     ` Stefan Hajnoczi

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20140812022712.GA23160@T430.nay.redhat.com \
    --to=famz@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=stefanha@gmail.com \
    --cc=wu.wubin@huawei.com \
    --cc=zhanghy@sangfor.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).