All of lore.kernel.org
 help / color / mirror / Atom feed
From: Josh Durgin <josh.durgin@inktank.com>
To: Oliver Francke <Oliver.Francke@filoo.de>
Cc: ceph-users@lists.ceph.com, Mike Dawson <mike.dawson@cloudapt.com>,
	Stefan Hajnoczi <stefanha@redhat.com>,
	"qemu-devel@nongnu.org" <qemu-devel@nongnu.org>
Subject: Re: [Qemu-devel] [ceph-users] qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to kernel_hung_tasks_timout_secs message and unresponsive qemu-process, [Bug 1207686]
Date: Thu, 08 Aug 2013 10:01:24 -0700	[thread overview]
Message-ID: <5203CEE4.7040901@inktank.com> (raw)
In-Reply-To: <520391D1.7070704@filoo.de>

On 08/08/2013 05:40 AM, Oliver Francke wrote:
> Hi Josh,
>
> I have a session logged with:
>
>      debug_ms=1:debug_rbd=20:debug_objectcacher=30
>
> as you requested from Mike, even if I think, we do have another story
> here, anyway.
>
> Host-kernel is: 3.10.0-rc7, qemu-client 1.6.0-rc2, client-kernel is
> 3.2.0-51-amd...
>
> Do you want me to open a ticket for that stuff? I have about 5MB
> compressed logfile waiting for you ;)

Yes, that'd be great. If you could include the time when you saw the 
guest hang that'd be ideal. I'm not sure if this is one or two bugs,
but it seems likely it's a bug in rbd and not qemu.

Thanks!
Josh

> Thnx in advance,
>
> Oliver.
>
> On 08/05/2013 09:48 AM, Stefan Hajnoczi wrote:
>> On Sun, Aug 04, 2013 at 03:36:52PM +0200, Oliver Francke wrote:
>>> Am 02.08.2013 um 23:47 schrieb Mike Dawson <mike.dawson@cloudapt.com>:
>>>> We can "un-wedge" the guest by opening a NoVNC session or running a
>>>> 'virsh screenshot' command. After that, the guest resumes and runs
>>>> as expected. At that point we can examine the guest. Each time we'll
>>>> see:
>> If virsh screenshot works then this confirms that QEMU itself is still
>> responding.  Its main loop cannot be blocked since it was able to
>> process the screendump command.
>>
>> This supports Josh's theory that a callback is not being invoked.  The
>> virtio-blk I/O request would be left in a pending state.
>>
>> Now here is where the behavior varies between configurations:
>>
>> On a Windows guest with 1 vCPU, you may see the symptom that the guest no
>> longer responds to ping.
>>
>> On a Linux guest with multiple vCPUs, you may see the hung task message
>> from the guest kernel because other vCPUs are still making progress.
>> Just the vCPU that issued the I/O request and whose task is in
>> UNINTERRUPTIBLE state would really be stuck.
>>
>> Basically, the symptoms depend not just on how QEMU is behaving but also
>> on the guest kernel and how many vCPUs you have configured.
>>
>> I think this can explain how both problems you are observing, Oliver and
>> Mike, are a result of the same bug.  At least I hope they are :).
>>
>> Stefan
>
>

  reply	other threads:[~2013-08-08 17:01 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <51FB887F.5070908@filoo.de>
     [not found] ` <51FC2903.3030802@cloudapt.com>
2013-08-04 13:36   ` [Qemu-devel] [ceph-users] qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to kernel_hung_tasks_timout_secs message and unresponsive qemu-process, [Bug 1207686] Oliver Francke
2013-08-05  7:48     ` Stefan Hajnoczi
2013-08-05 20:08       ` Mike Dawson
2013-08-13 21:26         ` Sage Weil
2013-08-13 22:00           ` James Harper
2013-08-08 12:40       ` Oliver Francke
2013-08-08 17:01         ` Josh Durgin [this message]
2013-08-09  9:22           ` Oliver Francke
2013-08-09 14:05             ` Andrei Mikhailovsky
2013-08-09 15:03               ` Stefan Hajnoczi
2013-08-10  7:30                 ` Josh Durgin
2013-08-13 21:34             ` Sage Weil

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=5203CEE4.7040901@inktank.com \
    --to=josh.durgin@inktank.com \
    --cc=Oliver.Francke@filoo.de \
    --cc=ceph-users@lists.ceph.com \
    --cc=mike.dawson@cloudapt.com \
    --cc=qemu-devel@nongnu.org \
    --cc=stefanha@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.