All of lore.kernel.org
 help / color / mirror / Atom feed
From: Mike Dawson <mike.dawson@cloudapt.com>
To: Stefan Hajnoczi <stefanha@redhat.com>
Cc: Josh Durgin <josh.durgin@inktank.com>,
	ceph-users@lists.ceph.com,
	Oliver Francke <Oliver.Francke@filoo.de>,
	"qemu-devel@nongnu.org" <qemu-devel@nongnu.org>
Subject: Re: [Qemu-devel] [ceph-users] qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to kernel_hung_tasks_timout_secs message and unresponsive qemu-process, [Bug 1207686]
Date: Mon, 05 Aug 2013 16:08:47 -0400	[thread overview]
Message-ID: <5200064F.5090103@cloudapt.com> (raw)
In-Reply-To: <20130805074835.GA12658@stefanha-thinkpad.muc.redhat.com>

Josh,

Logs are uploaded to cephdrop with the file name 
mikedawson-rbd-qemu-deadlock.

- At about 2013-08-05 19:46 or 47, we hit the issue, traffic went to 0
- At about 2013-08-05 19:53:51, ran a 'virsh screenshot'


Environment is:

- Ceph 0.61.7 (client is co-mingled with three OSDs)
- rbd cache = true and cache=writeback
- qemu 1.4.0 1.4.0+dfsg-1expubuntu4
- Ubuntu Raring with 3.8.0-25-generic

This issue is reproducible in my environment, and I'm willing to run any 
wip branch you need. What else can I provide to help?

Thanks,
Mike Dawson


On 8/5/2013 3:48 AM, Stefan Hajnoczi wrote:
> On Sun, Aug 04, 2013 at 03:36:52PM +0200, Oliver Francke wrote:
>> Am 02.08.2013 um 23:47 schrieb Mike Dawson <mike.dawson@cloudapt.com>:
>>> We can "un-wedge" the guest by opening a NoVNC session or running a 'virsh screenshot' command. After that, the guest resumes and runs as expected. At that point we can examine the guest. Each time we'll see:
>
> If virsh screenshot works then this confirms that QEMU itself is still
> responding.  Its main loop cannot be blocked since it was able to
> process the screendump command.
>
> This supports Josh's theory that a callback is not being invoked.  The
> virtio-blk I/O request would be left in a pending state.
>
> Now here is where the behavior varies between configurations:
>
> On a Windows guest with 1 vCPU, you may see the symptom that the guest no
> longer responds to ping.
>
> On a Linux guest with multiple vCPUs, you may see the hung task message
> from the guest kernel because other vCPUs are still making progress.
> Just the vCPU that issued the I/O request and whose task is in
> UNINTERRUPTIBLE state would really be stuck.
>
> Basically, the symptoms depend not just on how QEMU is behaving but also
> on the guest kernel and how many vCPUs you have configured.
>
> I think this can explain how both problems you are observing, Oliver and
> Mike, are a result of the same bug.  At least I hope they are :).
>
> Stefan
>

  reply	other threads:[~2013-08-05 20:09 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <51FB887F.5070908@filoo.de>
     [not found] ` <51FC2903.3030802@cloudapt.com>
2013-08-04 13:36   ` [Qemu-devel] [ceph-users] qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to kernel_hung_tasks_timout_secs message and unresponsive qemu-process, [Bug 1207686] Oliver Francke
2013-08-05  7:48     ` Stefan Hajnoczi
2013-08-05 20:08       ` Mike Dawson [this message]
2013-08-13 21:26         ` Sage Weil
2013-08-13 22:00           ` James Harper
2013-08-08 12:40       ` Oliver Francke
2013-08-08 17:01         ` Josh Durgin
2013-08-09  9:22           ` Oliver Francke
2013-08-09 14:05             ` Andrei Mikhailovsky
2013-08-09 15:03               ` Stefan Hajnoczi
2013-08-10  7:30                 ` Josh Durgin
2013-08-13 21:34             ` Sage Weil

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5200064F.5090103@cloudapt.com \
    --to=mike.dawson@cloudapt.com \
    --cc=Oliver.Francke@filoo.de \
    --cc=ceph-users@lists.ceph.com \
    --cc=josh.durgin@inktank.com \
    --cc=qemu-devel@nongnu.org \
    --cc=stefanha@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.