All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Christopher S. Aker" <caker@theshore.net>
To: Jeremy Fitzhardinge <jeremy@goop.org>
Cc: xen devel <xen-devel@lists.xensource.com>
Subject: Re: High Net and Disk Use == stuck domain
Date: Mon, 01 Dec 2008 10:05:40 -0500	[thread overview]
Message-ID: <4933FD44.7050101@theshore.net> (raw)
In-Reply-To: <4926E7DD.8040603@theshore.net>

Christopher S. Aker wrote:
> For the past year or so we've been seeing a bug whereby a domU's CPU 
> would spin up to a steady 100, 200, 300 or 400% (4 vcpus), console would 
> freeze, and some or all of the network-facing services within the domU 
> would connect but block without any output.  Disk IO would flatline. The 
> domU would never recover and required rebooting.
> 
> Since pv_ops hasn't always been around, we previously had only seen this 
> behavior with xen-patched domUs (2.6.18.x), but now we're seeing it with 
> pv_ops.  Identical symptoms.  And, I have a user that is able to 
> reliable reproduce it on 2.6.27.4!
> 
> His recipe is downloading an ISO from a very fast and close-by news 
> server using nzbget.  The trigger appears to be a combination of high 
> network use and high disk use (like download from a very fast mirror) -- 
> because we weren't able to reproduce the problem when saving to a tmpfs 
> mount.
> 
> I was able to grab the output of sysrq t while it was in the bad state:
> 
> http://theshore.net/~caker/xen/BUGS/D-state/console.log
> 
> The number of processes in D state (39) is quite suspicious.
> 
> Let me know if there's anything else I can provide.
> 
> -Chris

Jeremy,

Did this one slip by you?  I figured a reproducible bug would be just 
too tantalizing to resist.

What's the correct venue for these issues that overlap xen-devel, lkml, 
and virtualization/pv_ops stuff -- should I be blasting these to everybody?

-Chris

  parent reply	other threads:[~2008-12-01 15:05 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2008-11-21 16:54 High Net and Disk Use == stuck domain Christopher S. Aker
2008-11-21 17:07 ` Stefan de Konink
2008-11-21 17:16   ` Christopher S. Aker
2008-12-01 15:05 ` Christopher S. Aker [this message]
2008-12-01 20:19   ` Jeremy Fitzhardinge
2008-12-01 21:00     ` Christopher S. Aker

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4933FD44.7050101@theshore.net \
    --to=caker@theshore.net \
    --cc=jeremy@goop.org \
    --cc=xen-devel@lists.xensource.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.