From: Jeremy Fitzhardinge <jeremy@goop.org>
To: "Christopher S. Aker" <caker@theshore.net>
Cc: xen devel <xen-devel@lists.xensource.com>
Subject: Re: High Net and Disk Use == stuck domain
Date: Mon, 01 Dec 2008 12:19:50 -0800 [thread overview]
Message-ID: <493446E6.2060706@goop.org> (raw)
In-Reply-To: <4933FD44.7050101@theshore.net>
Christopher S. Aker wrote:
> Christopher S. Aker wrote:
>> For the past year or so we've been seeing a bug whereby a domU's CPU
>> would spin up to a steady 100, 200, 300 or 400% (4 vcpus), console
>> would freeze, and some or all of the network-facing services within
>> the domU would connect but block without any output. Disk IO would
>> flatline. The domU would never recover and required rebooting.
>>
>> Since pv_ops hasn't always been around, we previously had only seen
>> this behavior with xen-patched domUs (2.6.18.x), but now we're seeing
>> it with pv_ops. Identical symptoms. And, I have a user that is able
>> to reliable reproduce it on 2.6.27.4!
>>
>> His recipe is downloading an ISO from a very fast and close-by news
>> server using nzbget. The trigger appears to be a combination of high
>> network use and high disk use (like download from a very fast mirror)
>> -- because we weren't able to reproduce the problem when saving to a
>> tmpfs mount.
>>
>> I was able to grab the output of sysrq t while it was in the bad state:
>>
>> http://theshore.net/~caker/xen/BUGS/D-state/console.log
>>
>> The number of processes in D state (39) is quite suspicious.
>>
>> Let me know if there's anything else I can provide.
>>
>> -Chris
>
> Jeremy,
>
> Did this one slip by you? I figured a reproducible bug would be just
> too tantalizing to resist.
Hoping it would go away by itself? ;)
I'm trying to repro it now, copying ISOs at 25 Mbytes/sec. How long
does it take to happen?
> What's the correct venue for these issues that overlap xen-devel,
> lkml, and virtualization/pv_ops stuff -- should I be blasting these to
> everybody?
Me and xen-devel are a good start, and posting in a bugzilla cc:ing me
if it looks like its been dropped on the floor.
J
next prev parent reply other threads:[~2008-12-01 20:19 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-11-21 16:54 High Net and Disk Use == stuck domain Christopher S. Aker
2008-11-21 17:07 ` Stefan de Konink
2008-11-21 17:16 ` Christopher S. Aker
2008-12-01 15:05 ` Christopher S. Aker
2008-12-01 20:19 ` Jeremy Fitzhardinge [this message]
2008-12-01 21:00 ` Christopher S. Aker
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=493446E6.2060706@goop.org \
--to=jeremy@goop.org \
--cc=caker@theshore.net \
--cc=xen-devel@lists.xensource.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.