xen-devel.lists.xenproject.org archive mirror
 help / color / mirror / Atom feed
From: Dan Magenheimer <dan.magenheimer@oracle.com>
To: dan.magenheimer@oracle.com
Cc: Jeremy Fitzhardinge <jeremy@goop.org>, xen-devel@lists.xensource.com
Subject: RE: pvops domu soft lockup under load (more logs)
Date: Fri, 16 Apr 2010 09:00:00 -0700 (PDT)	[thread overview]
Message-ID: <3f9fc65b-79b4-4736-aa11-00585306f3e0@default> (raw)
In-Reply-To: <f0c5ff4d-2be5-4915-97b6-13142c9e3a3c@default>

(Resending to list... Our corporate email gateway sadly has problems with
a "+" in an email address so I can't reply to Pim directly... if someone
could forward, I would appreciate it.)

Hi Pim --

I haven't read all of your previous postings, but with what you are
seeing, I'd be suspicious that your TSC's may be getting badly out
of sync, possibly due to power management.

Could you try booting xen on the "bad" machines with the
Xen boot parameter: max_cstate=0

There may also be power management settings in the BIOS that
can be changed.

Hope that helps!
Dan

> > -----Original Message-----
> > From: Pim van Riezen [mailto:pi+lists@panelsix.com]
> > Sent: Friday, April 16, 2010 1:56 AM
> > To: Pim van Riezen
> > Cc: Jeremy Fitzhardinge; xen-devel@lists.xensource.com
> > Subject: Re: [Xen-devel] pvops domu soft lockup under load (more
> logs)
> >
> > Oh,
> >
> > On Apr 16, 2010, at 9:37 , Pim van Riezen wrote:
> >
> > > Another datapoint. This customer has similarly loaded VPS machines
> on
> > a number of different hardware nodes. Not all of them had the lockup
> > problem. I applied the jiffies clocksource to all his machines,
> > regardless of their current problem status. After a day without
> > lockups, the customer complained about time drift (ntp was not
> > activated). The guest that had experienced the soft lockups earlier
> had
> > major clock drift and were way ahead:
> > >
> > > 	16 Apr 09:29:26 ntpdate[11236]: step time server 194.109.22.18
> > offset -7337.731686 sec
> > >
> > > That's over 2 hours accumulated in less than 24 hours of uptime.
> The
> > guests that hadn't been excperiencing the lockup issues berfore
> > switching to the jiffies clocksource hadn't drifted that much after
> the
> > switch and were, at most, 120s behind after the same amount of
> runtime.
> >
> > There's more correlation between the guests that had the lockups and
> > those that didn't: the guests that locked up (and now have a way
> speedy
> > jiffies clock) were all on the same hardware platform, with an older
> > Xeon CPU than on the guests that had no issues. I attached cpuinfo
> for
> > both the broken and the non-broken dom0s. All are on Xen-3.4.1
> > (hypervisor-version doesn't seem to affect this issue) and the latest
> > CentOS 2.6.18 dom0-kernel.
> >
> > Cheers,
> > Pim
> >

  reply	other threads:[~2010-04-16 16:00 UTC|newest]

Thread overview: 17+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-04-09 13:22 pvops domu soft lockup under load Pim van Riezen
2010-04-09 16:50 ` pvops domu soft lockup under load (more logs) Pim van Riezen
2010-04-09 18:43   ` Jeremy Fitzhardinge
2010-04-10  7:12     ` Pim van Riezen
2010-04-12  9:44       ` Pim van Riezen
2010-04-14 17:48         ` Jeremy Fitzhardinge
2010-04-15 10:56           ` Pim van Riezen
2010-04-15 17:21             ` Jeremy Fitzhardinge
2010-04-16  7:37               ` Pim van Riezen
2010-04-16  7:55                 ` Pim van Riezen
2010-04-16 15:17                   ` Dan Magenheimer
2010-04-16 16:00                     ` Dan Magenheimer [this message]
2010-04-16 15:24                   ` Pim van Riezen
     [not found]                 ` <9D75997F-4855-41A0-B159-18B6A3BFC776@panelsix.com>
2010-04-16 13:25                   ` Pim van Riezen
2010-04-16 20:22                 ` Jeremy Fitzhardinge
2010-04-16 14:20               ` Pim van Riezen
2010-04-16 14:19           ` Pim van Riezen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3f9fc65b-79b4-4736-aa11-00585306f3e0@default \
    --to=dan.magenheimer@oracle.com \
    --cc=jeremy@goop.org \
    --cc=xen-devel@lists.xensource.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).