From: Olivier Hanesse <olivier.hanesse@gmail.com>
To: Dan Magenheimer <dan.magenheimer@oracle.com>
Cc: Jeremy Fitzhardinge <jeremy@goop.org>,
xen-devel@lists.xensource.com, Keir Fraser <keir@xen.org>,
Jan Beulich <JBeulich@novell.com>,
Keir Fraser <keir.xen@gmail.com>,
Xen Users <xen-users@lists.xensource.com>,
Mark Adams <mark@campbell-lange.net>
Subject: Re: Xen 4 TSC problems
Date: Mon, 28 Feb 2011 16:54:29 +0100 [thread overview]
Message-ID: <AANLkTinK94GDOF_GeaVs50wK0u8EdZ2rtzPuh65ca8qR@mail.gmail.com> (raw)
In-Reply-To: <b1ca6ddc-ca69-447e-93e9-8d36d1ff4a43@default>
[-- Attachment #1.1: Type: text/plain, Size: 6811 bytes --]
Yes this is what I mean.
I am glad to hear that it isn't a bad sign :)
I thought of a bad sign, because on system with "reliable TSC", this counter
is always 0.
2011/2/28 Dan Magenheimer <dan.magenheimer@oracle.com>
> Hi Olivier –
>
>
>
> By “warp=3000+ in debug message” do you mean the Xen boot message “TSC has
> constant rate..., warp = NNNN”?
>
>
>
> If so, this is a very different “warp” measured in cycles, not in seconds,
> so 3000 is more like a microsecond not an hour, ! and this is normal (not a
> bad sign).
>
>
>
> Dan
>
>
>
> *From:* Olivier Hanesse [mailto:olivier.hanesse@gmail.com]
> *Sent:* Monday, February 28, 2011 8:23 AM
> *To:* Dan Magenheimer
> *Cc:* Jeremy Fitzhardinge; Keir Fraser; Jan Beulich; Mark Adams;
> xen-devel@lists.xensource.com; Xen Users; Keir Fraser
>
> *Subject:* Re: [Xen-devel] Xen 4 TSC problems
>
>
>
> Keir :
>
>
>
> Yes, it is "under progress".
>
> To make this change, I had to reboot every server, so it is taking time
> (production server :()
>
> So i was hoping to find a quick method to mitigate this issue on domUs
> while rebooting servers.
>
>
>
> As this bug happens once or twice per server since October, I can't say
> that right now that changing platform timer to PIT fixed it. I have to wait
> (I hope forever!) this bug to happen again on a 'patched' server ...
>
>
>
> But even with clcoksource=pit, I am seeing some warp=3000+ in debug message
> ? I guess it is not a good sign, is it ?
>
>
>
> Jan : I was hoping to find a way to make the domU clocksource more
> "independent" like with xen3.2.
>
>
>
>
>
> 2011/2/28 Dan Magenheimer <dan.magenheimer@oracle.com>
>
> Hi Olivier –
>
>
>
> It is the Xen clocksource that you want to try to change, not the dom0
> clocksource. To do this, you need to specify “clocksource=pit” on the Xen
> boot line (and reboot), not the dom0 boot line.
>
>
>
> I believe Mark Adams played with tsc_mode to see if it solved! his
> (similar? identical?) problem last year, and it didn’t make any difference.
>
>
> Please try booting Xen with “clocksource=pit” and ensu! re that “Platform
> timer is 1.19MHz PIT” appears in the Xen boot messages. If the 50min jump
> does not appear again, it would point to a problem in the hpet, either
> hardware or software.
>
>
>
> Thanks,
>
> Dan
>
>
>
> *From:* Olivier Hanesse [mailto:olivier.hanesse@gmail.com]
> *Sent:* Monday, February 28, 2011 7:37 AM
> *To:* Jeremy Fitzhardinge
> *Cc:* Dan Magenheimer; Keir Fraser; Jan Beulich; Mark Adams;
> xen-devel@lists.xensource.com; Xen Users; Keir Fraser
>
>
> *Subject:* Re: [Xen-devel] Xen 4 TSC problems
>
>
>
> Hello,
>
>
>
> It happened again twice this weekend.
>
>
>
> What about setting "tsc_mode=2" for my vms ? Should this mode prevent this
> bug (coming from a bad emulated tsc due to firmware issue ? is it possible
> ?) from affecting time in domUs ?
>
>
>
> Setting clocksource=pit, make 'tsc' available in
> "/sys/devices/system/clocksource/clocksource0/available_clocksource"
> (otherwise only xen is available, is it norma! l ? ).
>
>
>
> Should I bypass xen clocksource and use tsc as a clocksource for dom0/domU
> ? or will it be worsed ?
>
>
>
> Regards
>
>
>
> Olivier
>
>
>
> 2011/2/24 Jeremy Fitzhardinge <jeremy@goop.org>
>
> On 02/24/2011 09:43 AM, Dan Magenheimer wrote:
> > Just a wild guess, but this in Olivier's posted output:
> >
> > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more
> times.
> >
> > and the fact that a 32-bit HPET wrap is ~300 seconds and, with the
> > "10 or more times", 10 * 300 seconds is 3000 seconds, might be a clue
> > (or a complete red herring, but I thought it worth mentioning).
> >
> > Mark and Olivier, it would be interesting to know if you are
> > using the same processor/system.
>
> It definitely seems like some kind of problem on the host system rather
> than anything in the guests themselves. ! If the platform timer is
> misbehaving, then Xen could be completely screwing up the pvclock
> calibration which it then passes to guests.
>
> Could it be one of those "platform clock stops in certain power states"
> problems?
>
>
> J
>
> >> -----Original Message-----
> >> From: Keir Fraser [mailto:keir.xen@gmail.com]
> >> Sent: Thursday, February 24, 2011 7:52 AM
> >> To: Olivier Hanesse; Jan Beulich
> >> Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xen
> >> Users; Dan Magenheimer; Keir Fraser
> >> Subject: Re: [Xen-devel] Xen 4 TSC problems
> >>
> >> On 24/02/2011 14:20, "Olivier Hanesse" <olivier.hanesse@gmail.com>
> >> wrote:
> >>
> >>> Both dom0 and domUs are affected by this" jump".
> >>>
> >>> I expect to see something like "TSC marked as reliable, warp = 0".
> >>> I got this on newer hardware with same config/distros.
> >> It depends on the CPU itself, older CPUs do not have the super-stable
> >> TSC
> >> features. But that should never cause a massive 3000s time jump.
> >>
> >>> Is there a way to measure if it is a TSC warp ? to point out a cpu
> >> tsc issue ?
> >>
> >> The TSC warps or out-of-sync issues that we could reasonably expect
> >> would be
> >> on the order of microseconds. A 3000s warp is something else entirely.
> >> Xen
> >> is very confused and/or some TSC or platform timer has jumped a long
> >> way
> >> (indicating a hardware/firmware issue).
> >>
> >> -- Keir
> >>
>
> >>! ;> 2011/2/24 Jan Beulich <JBeulich@novell.com>
>
>
> >>>>>>> On 24.02.11 at 12:57, Olivier Hanesse <olivier.hanesse@gmail.com>
> >> wrote:
> >>>>> I tried to turn off cstates with max_cstate=0 without success
> >> (still "not
> >>>>> reliable").
> >>>>>
> >>>>> With cpuidle=0, I also got :
> >>>>>
> >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not
> >> reliable,
> >>>>> warp=3022 (count=1)
> >>>> This message by itself isn't telling much I believe.
> >>>>
> >>>>> xm info | grep command
> >>>>> xen_commandline : dom0_mem=512M cpuidle=0 loglvl=all
> >> guest_loglvl=all
>
> >>>>> dom0_max_vcpus=1 dom0_vcp! us_pin console=vga,com1 com1=19200,8n1
>
>
> >>>>>
> >>>>> Keir :
> >>>>>
> >>>>> Using clocksource=pit :
> >>>>>
> >>>>> (XEN) Platform timer is 1.193MHz PIT
> >>>>>
> >>>>> I also got :
> >>>>>
> >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not
> >> reliable,
> >>>>> warp=3262 (count=2)
> >>>> The question is whether any of this eliminates the time jumps seen
> >>>> by your DomU-s (from your past mails I wasn't actually sure whether
> >>>> Dom0 also experienced this problem, albeit it would be odd if it
> >> didn't).
> >>>> Jan
> >>>>
> >>>> Jan
> >>>>
> >>>
> >>
>
>
>
>
>
[-- Attachment #1.2: Type: text/html, Size: 12227 bytes --]
[-- Attachment #2: Type: text/plain, Size: 138 bytes --]
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel
next prev parent reply other threads:[~2011-02-28 15:54 UTC|newest]
Thread overview: 81+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-02-23 10:49 Xen 4 TSC problems Olivier Hanesse
2011-02-23 16:16 ` [Xen-devel] " Dan Magenheimer
2011-02-23 17:19 ` Keir Fraser
2011-02-23 19:04 ` Olivier Hanesse
2011-02-24 7:16 ` Keir Fraser
2011-02-24 9:59 ` Olivier Hanesse
2011-02-24 10:59 ` Jan Beulich
2011-02-24 11:30 ` [Xen-devel] " Keir Fraser
2011-02-24 11:57 ` Olivier Hanesse
2011-02-24 12:37 ` Jan Beulich
2011-02-24 14:20 ` Olivier Hanesse
2011-02-24 14:52 ` [Xen-devel] " Keir Fraser
2011-02-24 17:43 ` Dan Magenheimer
2011-02-24 17:58 ` Olivier Hanesse
2011-02-24 19:01 ` [Xen-devel] " Jeremy Fitzhardinge
2011-02-28 14:37 ` Olivier Hanesse
2011-02-28 15:00 ` Keir Fraser
2011-02-28 15:14 ` [Xen-devel] " Dan Magenheimer
2011-02-28 15:23 ` Olivier Hanesse
2011-02-28 15:30 ` Dan Magenheimer
2011-02-28 15:54 ` Olivier Hanesse [this message]
2011-04-15 7:51 ` andre.arnold
2011-04-15 16:31 ` Olivier Hanesse
2012-09-27 15:54 ` [Xen-devel] " Mauro
2012-09-27 19:27 ` [Xen-users] " Dan Magenheimer
2012-09-27 21:28 ` Olivier Hanesse
2012-09-27 21:42 ` [Xen-devel] " Mauro
2012-09-29 8:08 ` [Xen-users] " Olivier Hanesse
2012-09-29 9:41 ` [Xen-devel] " Mauro
2012-09-29 12:19 ` Mauro
2012-09-30 15:13 ` [Xen-users] " Pasi Kärkkäinen
2012-09-30 19:23 ` [Xen-devel] " Mauro
2012-09-30 20:19 ` Mauro
2012-10-15 7:39 ` [Xen-users] " Olivier Hanesse
2012-10-15 8:05 ` Philippe.Simonet
2012-10-15 9:39 ` Mauro
2012-10-15 10:32 ` Jan Beulich
2012-10-15 11:24 ` [Xen-devel] " Mauro
2012-10-15 12:49 ` [Xen-users] " Jan Beulich
2012-10-15 14:25 ` Mauro
2012-10-17 16:15 ` Keir Fraser
2012-10-18 7:40 ` Ian Campbell
2012-10-18 7:55 ` Keir Fraser
2012-10-18 8:33 ` Ian Campbell
2012-10-18 8:56 ` [Xen-devel] " Mauro
2012-10-18 9:36 ` [Xen-users] " Ian Campbell
2012-10-18 13:45 ` Philippe.Simonet
2012-10-18 16:43 ` Keir Fraser
2012-10-21 20:52 ` [Xen-devel] " Mauro
2012-10-22 6:54 ` [Xen-users] " Jan Beulich
2012-10-22 9:17 ` [Xen-devel] " Mauro
2012-10-22 9:27 ` [Xen-users] " Jan Beulich
2012-10-22 10:40 ` [Xen-devel] " Mauro
2012-10-22 12:06 ` [Xen-users] " Jan Beulich
2012-10-23 7:19 ` Mauro
[not found] ` <CAE17a0VU3H8UmUWZv1+t98ME8LFgCZAWWBC1nhzM=ecbu6_GaQ@mail.gmail.com>
2012-10-23 7:58 ` Jan Beulich
2012-10-23 8:40 ` Mauro
2012-10-23 8:50 ` Jan Beulich
2012-10-23 11:50 ` Konrad Rzeszutek Wilk
2012-10-23 14:07 ` Mauro
2012-10-23 14:43 ` Jan Beulich
2012-10-23 14:46 ` Mauro
2012-10-23 15:34 ` Mauro
2012-10-23 15:49 ` Jan Beulich
2012-10-01 11:39 ` Zary Matej
2011-02-28 15:39 ` [Xen-devel] " Keir Fraser
2012-09-29 15:13 ` Mauro
2011-09-13 7:16 ` Philippe Simonet
2011-09-15 8:23 ` Konrad Rzeszutek Wilk
2011-09-15 8:24 ` Konrad Rzeszutek Wilk
2011-09-15 16:24 ` Jeremy Fitzhardinge
2011-09-16 6:03 ` Philippe.Simonet
2011-09-16 22:40 ` Jeremy Fitzhardinge
2011-09-19 5:45 ` Philippe Simonet
2011-09-15 10:36 ` George Dunlap
2011-09-15 18:38 ` Dan Magenheimer
2011-09-19 10:39 ` George Dunlap
2011-09-22 12:07 ` Jan Beulich
2011-09-30 6:33 ` Philippe.Simonet
2011-09-30 9:36 ` tommics
2011-09-30 17:16 ` Dan Magenheimer
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=AANLkTinK94GDOF_GeaVs50wK0u8EdZ2rtzPuh65ca8qR@mail.gmail.com \
--to=olivier.hanesse@gmail.com \
--cc=JBeulich@novell.com \
--cc=dan.magenheimer@oracle.com \
--cc=jeremy@goop.org \
--cc=keir.xen@gmail.com \
--cc=keir@xen.org \
--cc=mark@campbell-lange.net \
--cc=xen-devel@lists.xensource.com \
--cc=xen-users@lists.xensource.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).