From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Xen 4 TSC problems Date: Wed, 23 Feb 2011 11:49:49 +0100 Message-ID: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0607008851==" Return-path: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: xen-devel@lists.xensource.com, Xen Users List-Id: xen-devel@lists.xenproject.org --===============0607008851== Content-Type: multipart/alternative; boundary=0016367f985e237c1f049cf0dbcf --0016367f985e237c1f049cf0dbcf Content-Type: text/plain; charset=ISO-8859-1 Hello I've got an issue about time keeping with Xen 4.0 (Debian squeeze release). My problem is here (hopefully I amn't the only one, so there might be a bug somewhere) : http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=599161#50 After some times, I got this error : Clocksource tsc unstable (delta = -2999660334211 ns). It has happened on several servers. Looking at the output of "xm debug-key s;" (XEN) TSC has constant rate, deep Cstates possible, so not reliable, warp=2850 (count=3) I am using a "Intel(R) Xeon(R) CPU L5420 @ 2.50GHz", which has the "constant_tsc", but not the "nonstop_tsc" one. On other systems with a newer cpu with "nonstop_tsc", I don't have this issue (systems are running the same distros with same config). I tried to boot with "max_cstate=0", but nothing changed, my TSC isn't reliable and after some times, I will got the "50min" issue again. I don't understand how a system can do a jump of "50min" in the future. Why 50min ? it is not 40min, not 1 hour, it is always 50min. I don't know how to make my TSC "reliable" (I already disable everything about Powerstate in BIOS Settings). Any ideas ? Regards Olivier --0016367f985e237c1f049cf0dbcf Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hello

I've got an issue about time keepin= g with Xen 4.0 (Debian squeeze release).=A0

My pro= blem is here (hopefully I amn't the only one, so there might be a bug s= omewhere) : http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=3D599161#50<= /div>
After some times, =A0I got this error : Clocksource tsc unstable (delt= a =3D -2999660334211 ns). It has happened on several servers.=A0
=
Looking at the output of "xm debug-key s;"

(XEN) TSC has constant rate, deep Cstates possible, so not r= eliable, warp=3D2850 (count=3D3)

I am using a &quo= t;Intel(R) Xeon(R) CPU L5420 =A0@ 2.50GHz", which has the "consta= nt_tsc", but not the "nonstop_tsc" one.
On other systems with a newer cpu with "nonstop_tsc", I don&= #39;t have this issue (systems are running the same distros with same confi= g).

I tried to boot with "max_cstate=3D0"= ;, but nothing changed, my TSC isn't reliable and after some times, I w= ill got the "50min" issue again.

I don't understand how a system can do a jump of &q= uot;50min" in the future. Why 50min ? it is not 40min, not 1 hour, it = is always 50min.
I don't know how to make my TSC "reliab= le" (I already disable everything about Powerstate in BIOS Settings).<= /div>

Any ideas ?

Regards
=
Olivier
--0016367f985e237c1f049cf0dbcf-- --===============0607008851== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xensource.com http://lists.xensource.com/xen-devel --===============0607008851==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Dan Magenheimer Subject: RE: [Xen-devel] Xen 4 TSC problems Date: Wed, 23 Feb 2011 08:16:20 -0800 (PST) Message-ID: <4e4ddd9f-ed80-48b7-b001-c6b02c0d1935@default> References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0989069949==" Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xensource.com Errors-To: xen-users-bounces@lists.xensource.com To: Olivier Hanesse , xen-devel@lists.xensource.com, Xen Users Cc: Jeremy Fitzhardinge , keir@xen.org, Mark Adams List-Id: xen-devel@lists.xenproject.org --===============0989069949== Content-Type: multipart/alternative; boundary="__129847779289796705abhmt006" --__129847779289796705abhmt006 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable It's very unlikely this is a problem with TSC. It is most likely a Xen (or = possibly a PV Linux) problem where a guest (or dom0) either "goes out to lu= nch" for a long period, or some other timer gets stuck. The "clocksource t= sc unstable" message is a side effect of this... it's very likely the TSC t= hat IS stable and correct and the other clocksource (pvclock) has lost/gain= ed 50 minutes! =20 Mark Adams cc'ed and his original xen-devel posting below. The fact that t= wo different users (possibly on the same processor/system type?) have submi= tted the message with a delta so similar would lead me to believe there is = some timer that is "wrapping". And since pvclock is usually the clocksourc= e for dom0, and pvclock is driven by Xen's "system time", a reasonable gues= s is that the timer that is wrapping is in Xen itself. =20 Mark's delta =3D -2999660303788 ns Your delta =3D -2999660334211 ns =20 Googling, I see the HPET wraparound is ~306 seconds and this delta is about= 3000 seconds, so that may be a bad guess. =20 Keir, any thoughts on this? Do you recall any post-4.0 patches that may ha= ve fixed this? =20 Thanks, Dan =20 References: http://lists.xensource.com/archives/html/xen-devel/2010-10/msg00210.html https://lkml.org/lkml/2010/10/26/126 =20 From: Olivier Hanesse [mailto:olivier.hanesse@gmail.com]=20 Sent: Wednesday, February 23, 2011 3:50 AM To: xen-devel@lists.xensource.com; Xen Users Subject: [Xen-devel] Xen 4 TSC problems =20 Hello =20 I've got an issue about time keeping with Xen 4.0 (Debian squeeze release).= =20 =20 My problem is here (hopefully I amn't the only one, so there might be a bug= somewhere) : http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=3D599161#50 After some times, I got this error : Clocksource tsc unstable (delta =3D -= 2999660334211 ns). It has happened on several servers.=20 =20 Looking at the output of "xm debug-key s;" =20 (XEN) TSC has constant rate, deep Cstates possible, so not reliable, warp= =3D2850 (count=3D3) =20 I am using a "Intel(R) Xeon(R) CPU L5420 @ 2.50GHz", which has the "consta= nt_tsc", but not the "nonstop_tsc" one. On other systems with a newer cpu with "nonstop_tsc", I don't have this iss= ue (systems are running the same distros with same config). =20 I tried to boot with "max_cstate=3D0", but nothing changed, my TSC isn't re= liable and after some times, I will got the "50min" issue again. =20 I don't understand how a system can do a jump of "50min" in the future. Why= 50min ? it is not 40min, not 1 hour, it is always 50min. I don't know how to make my TSC "reliable" (I already disable everything ab= out Powerstate in BIOS Settings). =20 Any ideas ? =20 Regards =20 Olivier --__129847779289796705abhmt006 Content-Type: text/html; charset=us-ascii Content-Transfer-Encoding: quoted-printable

It’s v= ery unlikely this is a problem with TSC. It is most likely a Xen (or poss= ibly a PV Linux) problem where a guest (or dom0) either “goes out t= o lunch” for a long period, or some other timer gets stuck.  T= he “clocksource tsc unstable” message is a side effect of thi= s... it’s very likely the TSC that IS stable and correct and the ot= her clocksource (pvclock) has lost/gained 50 minutes!

 

Mark Adams cc&#= 8217;ed and his original xen-devel posting below.  The fact that two= different users (possibly on the same processor/system type?) have submi= tted the message with a delta so similar would lead me to believe there i= s some timer that is “wrapping”.  And since pvclock is u= sually the clocksource for dom0, and pvclock is driven! =20 by Xen’s “system time”, a reasonable guess is that the tim= er that is wrapping is in Xen itself.

=  

Mark’s delta =3D -2999660= 303788 ns

Your delta =3D -299966033421= 1 ns

 

Googling, I see the HPET wraparound is ~306 seconds and this del= ta is about 3000 seconds, so that may be a bad guess.

 

Keir, any thoug= hts on this?  Do you recall any post-4.0 patches that may have fixed= this?

 

Thanks,

Dan

 

References:<= o:p>

http://lists.xensource.com/archives/html/xen-devel/2010-10/msg00= 210.html

https://lkml.org/lkml/2010/10/26/126=

 

From: Olivier Hane= sse [mailto:olivier.hanesse@gmail.com]
Sent: Wednesday, Februa= ry 23, 2011 3:50 AM
To: xen-devel@lists.xensource.co! =20 m; Xen Users
Subject: [Xen-devel] Xen 4 TSC problems
=

 

Hello

 =

I've got an issue about time ke= eping with Xen 4.0 (Debian squeeze release). 

 

My problem is here (hopefully I amn't the only one, so there might be= a bug somewhere) : http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=3D59= 9161#50

After some time= s,  I got this error : Clocksource tsc unstable (delta =3D -29996603= 34211 ns). It has happened on several servers. 

=

 

Looking at the output of "xm debug-key s;"

 <= /p>

(XEN) TSC has constant rate, deep Csta= tes possible, so not reliable, warp=3D2850 (count=3D3)

 

I am using a "Intel(R) Xeon(R) CPU L5420  @ 2.50GHz&quo= t;, which has the "constant_tsc", but not the "nonstop_tsc= " one.

On other system= s with a newer cpu with "nonstop_tsc", I don't have this issue = (systems are running the same distros with same config).

 

I tried to boot with "max_cstate=3D0", but nothing ch= anged, my TSC isn't reliable and after some times, I will got the "5= 0min" issue again.

 

I don't unders! =20 tand how a system can do a jump of "50min" in the future. Why 50min ? it= is not 40min, not 1 hour, it is always 50min.

<= p class=3DMsoNormal>I don't know how to make my TSC "reliable" = (I already disable everything about Powerstate in BIOS Settings).

 

Any ideas ?

 

Regards

 

Olivier

--__129847779289796705abhmt006-- --===============0989069949== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users --===============0989069949==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Keir Fraser Subject: Re: Xen 4 TSC problems Date: Wed, 23 Feb 2011 17:19:13 +0000 Message-ID: References: <4e4ddd9f-ed80-48b7-b001-c6b02c0d1935@default> Mime-Version: 1.0 Content-Type: text/plain; charset="ISO-8859-1" Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: <4e4ddd9f-ed80-48b7-b001-c6b02c0d1935@default> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Dan Magenheimer , Olivier Hanesse , xen-devel@lists.xensource.com, Xen Users Cc: Jeremy Fitzhardinge , Mark Adams List-Id: xen-devel@lists.xenproject.org On 23/02/2011 16:16, "Dan Magenheimer" wrote: > It=B9s very unlikely this is a problem with TSC. It is most likely a Xen (o= r > possibly a PV Linux) problem where a guest (or dom0) either =B3goes out to > lunch=B2 for a long period, or some other timer gets stuck. The =B3clocksour= ce > tsc unstable=B2 message is a side effect of this... it=B9s very likely the TS= C > that IS stable and correct and the other clocksource (pvclock) has lost/g= ained > 50 minutes! > =20 > Mark Adams cc=B9ed and his original xen-devel posting below. The fact that= two > different users (possibly on the same processor/system type?) have submit= ted > the message with a delta so similar would lead me to believe there is som= e > timer that is =B3wrapping=B2. And since pvclock is usually the clocksource f= or > dom0, and pvclock is driven! by Xen=B9s =B3system time=B2, a reasonable guess = is > that the timer that is wrapping is in Xen itself. > =20 > Mark=B9s delta =3D -2999660303788 ns > Your delta =3D -2999660334211 ns > =20 > Googling, I see the HPET wraparound is ~306 seconds and this delta is abo= ut > 3000 seconds, so that may be a bad guess. > =20 > Keir, any thoughts on this? Do you recall any post-4.0 patches that may = have > fixed this? I've never seen a 3000s wrap, and I don't know of anything that would have fixed a bug like this. If this is a Xen time wrap of some kind then it woul= d affect all running guests; it's not clear here whether only one, or all, guests see the wrap. K. > Thanks, > Dan > =20 > References: > http://lists.xensource.com/archives/html/xen-devel/2010-10/msg00210.html > https://lkml.org/lkml/2010/10/26/126 > =20 >=20 > From: Olivier Hanesse [mailto:olivier.hanesse@gmail.com] > Sent: Wednesday, February 23, 2011 3:50 AM > To: xen-devel@lists.xensource.co! m; Xen Users > Subject: [Xen-devel] Xen 4 TSC problems > =20 >=20 > Hello >=20 > =20 >=20 > I've got an issue about time keeping with Xen 4.0 (Debian squeeze release= ). >=20 > =20 >=20 > My problem is here (hopefully I amn't the only one, so there might be a b= ug > somewhere) : http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=3D599161#50 >=20 > After some times, I got this error : Clocksource tsc unstable (delta =3D > -2999660334211 ns). It has happened on several servers. >=20 > =20 >=20 > Looking at the output of "xm debug-key s;" >=20 > =20 >=20 > (XEN) TSC has constant rate, deep Cstates possible, so not reliable, warp= =3D2850 > (count=3D3) >=20 > =20 >=20 > I am using a "Intel(R) Xeon(R) CPU L5420 @ 2.50GHz", which has the > "constant_tsc", but not the "nonstop_tsc" one. >=20 > On other systems with a newer cpu with "nonstop_tsc", I don't have this i= ssue > (systems are running the same distros with same config). >=20 > =20 >=20 > I tried to boot with "max_cstate=3D0", but nothing changed, my TSC isn't > reliable and after some times, I will got the "50min" issue again. >=20 > =20 >=20 > I don't unders! tand how a system can do a jump of "50min" in the future= . Why > 50min ? it is not 40min, not 1 hour, it is always 50min. >=20 > I don't know how to make my TSC "reliable" (I already disable everything = about > Powerstate in BIOS Settings). >=20 > =20 >=20 > Any ideas ? >=20 > =20 >=20 > Regards >=20 > =20 >=20 > Olivier >=20 From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Re: Xen 4 TSC problems Date: Wed, 23 Feb 2011 20:04:25 +0100 Message-ID: <4D655A39.2040501@gmail.com> References: Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Keir Fraser Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Xen Users , Jeremy Fitzhardinge , Mark Adams List-Id: xen-devel@lists.xenproject.org I am sorry for the lack of information. Every domUs on the dom0 are affected by this bug at the exact same time. And I had this bug on a dozen servers (all running on the same hw) since=20 October (when I switched from Xen 3.2 to 4.0). Regards Olivier Le 23/02/2011 18:19, Keir Fraser a =E9crit : > On 23/02/2011 16:16, "Dan Magenheimer" wro= te: > >> It=B9s very unlikely this is a problem with TSC. It is most likely a X= en (or >> possibly a PV Linux) problem where a guest (or dom0) either =B3goes ou= t to >> lunch=B2 for a long period, or some other timer gets stuck. The =B3cl= ocksource >> tsc unstable=B2 message is a side effect of this... it=B9s very likely= the TSC >> that IS stable and correct and the other clocksource (pvclock) has los= t/gained >> 50 minutes! >> >> Mark Adams cc=B9ed and his original xen-devel posting below. The fact= that two >> different users (possibly on the same processor/system type?) have sub= mitted >> the message with a delta so similar would lead me to believe there is = some >> timer that is =B3wrapping=B2. And since pvclock is usually the clocks= ource for >> dom0, and pvclock is driven! by Xen=B9s =B3system time=B2, a reasonab= le guess is >> that the timer that is wrapping is in Xen itself. >> >> Mark=B9s delta =3D -2999660303788 ns >> Your delta =3D -2999660334211 ns >> >> Googling, I see the HPET wraparound is ~306 seconds and this delta is = about >> 3000 seconds, so that may be a bad guess. >> >> Keir, any thoughts on this? Do you recall any post-4.0 patches that m= ay have >> fixed this? > I've never seen a 3000s wrap, and I don't know of anything that would h= ave > fixed a bug like this. If this is a Xen time wrap of some kind then it = would > affect all running guests; it's not clear here whether only one, or all= , > guests see the wrap. > > K. > >> Thanks, >> Dan >> >> References: >> http://lists.xensource.com/archives/html/xen-devel/2010-10/msg00210.ht= ml >> https://lkml.org/lkml/2010/10/26/126 >> >> >> From: Olivier Hanesse [mailto:olivier.hanesse@gmail.com] >> Sent: Wednesday, February 23, 2011 3:50 AM >> To: xen-devel@lists.xensource.co! m; Xen Users >> Subject: [Xen-devel] Xen 4 TSC problems >> >> >> Hello >> >> >> >> I've got an issue about time keeping with Xen 4.0 (Debian squeeze rele= ase). >> >> >> >> My problem is here (hopefully I amn't the only one, so there might be = a bug >> somewhere) : http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=3D599161= #50 >> >> After some times, I got this error : Clocksource tsc unstable (delta = =3D >> -2999660334211 ns). It has happened on several servers. >> >> >> >> Looking at the output of "xm debug-key s;" >> >> >> >> (XEN) TSC has constant rate, deep Cstates possible, so not reliable, w= arp=3D2850 >> (count=3D3) >> >> >> >> I am using a "Intel(R) Xeon(R) CPU L5420 @ 2.50GHz", which has the >> "constant_tsc", but not the "nonstop_tsc" one. >> >> On other systems with a newer cpu with "nonstop_tsc", I don't have thi= s issue >> (systems are running the same distros with same config). >> >> >> >> I tried to boot with "max_cstate=3D0", but nothing changed, my TSC isn= 't >> reliable and after some times, I will got the "50min" issue again. >> >> >> >> I don't unders! tand how a system can do a jump of "50min" in the fut= ure. Why >> 50min ? it is not 40min, not 1 hour, it is always 50min. >> >> I don't know how to make my TSC "reliable" (I already disable everythi= ng about >> Powerstate in BIOS Settings). >> >> >> >> Any ideas ? >> >> >> >> Regards >> >> >> >> Olivier >> > From mboxrd@z Thu Jan 1 00:00:00 1970 From: Keir Fraser Subject: Re: Xen 4 TSC problems Date: Thu, 24 Feb 2011 07:16:05 +0000 Message-ID: References: <4D655A39.2040501@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset="ISO-8859-1" Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: <4D655A39.2040501@gmail.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Olivier Hanesse Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Xen Users , Jeremy Fitzhardinge , Mark Adams List-Id: xen-devel@lists.xenproject.org Please send Xen boot output (xm dmesg). Getting it from Xen 3.2 as well would be interesting, if you still have it installed on any of these machines. -- Keir On 23/02/2011 19:04, "Olivier Hanesse" wrote: > I am sorry for the lack of information. > Every domUs on the dom0 are affected by this bug at the exact same time. >=20 > And I had this bug on a dozen servers (all running on the same hw) since > October (when I switched from Xen 3.2 to 4.0). >=20 > Regards >=20 > Olivier >=20 > Le 23/02/2011 18:19, Keir Fraser a =E9crit : >> On 23/02/2011 16:16, "Dan Magenheimer" wrot= e: >>=20 >>> It=B9s very unlikely this is a problem with TSC. It is most likely a Xen = (or >>> possibly a PV Linux) problem where a guest (or dom0) either =B3goes out t= o >>> lunch=B2 for a long period, or some other timer gets stuck. The =B3clockso= urce >>> tsc unstable=B2 message is a side effect of this... it=B9s very likely the = TSC >>> that IS stable and correct and the other clocksource (pvclock) has >>> lost/gained >>> 50 minutes! >>>=20 >>> Mark Adams cc=B9ed and his original xen-devel posting below. The fact th= at >>> two >>> different users (possibly on the same processor/system type?) have subm= itted >>> the message with a delta so similar would lead me to believe there is s= ome >>> timer that is =B3wrapping=B2. And since pvclock is usually the clocksource= for >>> dom0, and pvclock is driven! by Xen=B9s =B3system time=B2, a reasonable gues= s is >>> that the timer that is wrapping is in Xen itself. >>>=20 >>> Mark=B9s delta =3D -2999660303788 ns >>> Your delta =3D -2999660334211 ns >>>=20 >>> Googling, I see the HPET wraparound is ~306 seconds and this delta is a= bout >>> 3000 seconds, so that may be a bad guess. >>>=20 >>> Keir, any thoughts on this? Do you recall any post-4.0 patches that ma= y >>> have >>> fixed this? >> I've never seen a 3000s wrap, and I don't know of anything that would ha= ve >> fixed a bug like this. If this is a Xen time wrap of some kind then it w= ould >> affect all running guests; it's not clear here whether only one, or all, >> guests see the wrap. >>=20 >> K. >>=20 >>> Thanks, >>> Dan >>>=20 >>> References: >>> http://lists.xensource.com/archives/html/xen-devel/2010-10/msg00210.htm= l >>> https://lkml.org/lkml/2010/10/26/126 >>>=20 >>>=20 >>> From: Olivier Hanesse [mailto:olivier.hanesse@gmail.com] >>> Sent: Wednesday, February 23, 2011 3:50 AM >>> To: xen-devel@lists.xensource.co! m; Xen Users >>> Subject: [Xen-devel] Xen 4 TSC problems >>>=20 >>>=20 >>> Hello >>>=20 >>>=20 >>>=20 >>> I've got an issue about time keeping with Xen 4.0 (Debian squeeze relea= se). >>>=20 >>>=20 >>>=20 >>> My problem is here (hopefully I amn't the only one, so there might be a= bug >>> somewhere) : http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=3D599161#50 >>>=20 >>> After some times, I got this error : Clocksource tsc unstable (delta =3D >>> -2999660334211 ns). It has happened on several servers. >>>=20 >>>=20 >>>=20 >>> Looking at the output of "xm debug-key s;" >>>=20 >>>=20 >>>=20 >>> (XEN) TSC has constant rate, deep Cstates possible, so not reliable, >>> warp=3D2850 >>> (count=3D3) >>>=20 >>>=20 >>>=20 >>> I am using a "Intel(R) Xeon(R) CPU L5420 @ 2.50GHz", which has the >>> "constant_tsc", but not the "nonstop_tsc" one. >>>=20 >>> On other systems with a newer cpu with "nonstop_tsc", I don't have this >>> issue >>> (systems are running the same distros with same config). >>>=20 >>>=20 >>>=20 >>> I tried to boot with "max_cstate=3D0", but nothing changed, my TSC isn't >>> reliable and after some times, I will got the "50min" issue again. >>>=20 >>>=20 >>>=20 >>> I don't unders! tand how a system can do a jump of "50min" in the futu= re. >>> Why >>> 50min ? it is not 40min, not 1 hour, it is always 50min. >>>=20 >>> I don't know how to make my TSC "reliable" (I already disable everythin= g >>> about >>> Powerstate in BIOS Settings). >>>=20 >>>=20 >>>=20 >>> Any ideas ? >>>=20 >>>=20 >>>=20 >>> Regards >>>=20 >>>=20 >>>=20 >>> Olivier >>>=20 >>=20 >=20 From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Re: Xen 4 TSC problems Date: Thu, 24 Feb 2011 10:59:05 +0100 Message-ID: References: <4D655A39.2040501@gmail.com> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1147676405==" Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Keir Fraser Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Xen Users , Jeremy Fitzhardinge , Mark Adams List-Id: xen-devel@lists.xenproject.org --===============1147676405== Content-Type: multipart/alternative; boundary=0015175caae4859f9a049d0443f3 --0015175caae4859f9a049d0443f3 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable xm dmesg : (XEN) Xen version 4.0.1 (Debian 4.0.1-2) (waldi@debian.org) (gcc version 4.4.5 (Debian 4.4.5-10) ) Wed Jan 12 14:04:06 UTC 2011 (XEN) Bootloader: GNU GRUB 0.97 (XEN) Command line: dom0_mem=3D512M loglvl=3Dall guest_loglvl=3Dall dom0_max_vcpus=3D1 dom0_vcpus_pin console=3Dvga,com1 com1=3D19200,8n1 (XEN) Video information: (XEN) VGA is text mode 80x25, font 8x16 (XEN) VBE/DDC methods: none; EDID transfer time: 2 seconds (XEN) EDID info not retrieved because no DDC retrieval method detected (XEN) Disc information: (XEN) Found 2 MBR signatures (XEN) Found 2 EDD information structures (XEN) Xen-e820 RAM map: (XEN) 0000000000000000 - 000000000009ac00 (usable) (XEN) 000000000009ac00 - 00000000000a0000 (reserved) (XEN) 00000000000e0000 - 0000000000100000 (reserved) (XEN) 0000000000100000 - 00000000bffc7980 (usable) (XEN) 00000000bffc7980 - 00000000bffcee80 (ACPI data) (XEN) 00000000bffcee80 - 00000000c0000000 (reserved) (XEN) 00000000e0000000 - 00000000f0000000 (reserved) (XEN) 00000000fec00000 - 0000000100000000 (reserved) (XEN) 0000000100000000 - 00000002c0000000 (usable) (XEN) ACPI: RSDP 000FDFD0, 0024 (r2 IBM ) (XEN) ACPI: XSDT BFFCED40, 0054 (r1 IBM SERDEFNT 1000 IBM 45444F43) (XEN) ACPI: FACP BFFCEC80, 0084 (r2 IBM SERDEFNT 1000 IBM 45444F43) (XEN) ACPI: DSDT BFFC7980, 2EDA (r2 IBM SERDEFNT 1000 INTL 20041203) (XEN) ACPI: FACS BFFCAB00, 0040 (XEN) ACPI: APIC BFFCEB80, 00BC (r1 IBM SERDEFNT 1000 IBM 45444F43) (XEN) ACPI: SRAT BFFCEA00, 0128 (r1 IBM SERDEFNT 1000 IBM 45444F43) (XEN) ACPI: HPET BFFCE9C0, 0038 (r1 IBM SERDEFNT 1000 IBM 45444F43) (XEN) ACPI: MCFG BFFCE980, 003C (r1 IBM SERDEFNT 1000 IBM 45444F43) (XEN) ACPI: ERST BFFCAB40, 0230 (r1 IBM SERDEFNT 1000 IBM 45444F43) (XEN) System RAM: 10239MB (10485124kB) (XEN) SRAT: PXM 0 -> APIC 0 -> Node 0 (XEN) SRAT: PXM 0 -> APIC 1 -> Node 0 (XEN) SRAT: PXM 0 -> APIC 2 -> Node 0 (XEN) SRAT: PXM 0 -> APIC 3 -> Node 0 (XEN) SRAT: PXM 0 -> APIC 4 -> Node 0 (XEN) SRAT: PXM 0 -> APIC 5 -> Node 0 (XEN) SRAT: PXM 0 -> APIC 6 -> Node 0 (XEN) SRAT: PXM 0 -> APIC 7 -> Node 0 (XEN) SRAT: Node 0 PXM 0 0-c0000000 (XEN) SRAT: Node 0 PXM 0 100000000-2c0000000 (XEN) SRAT: hot plug zone found 2c0000000 - 1000000000 (XEN) SRAT: Node 0 PXM 0 2c0000000-1000000000 (XEN) NUMA: Allocated memnodemap from 2bfdfe000 - 2bfdff000 (XEN) NUMA: Using 18 for the hash shift. (XEN) Domain heap initialised (XEN) found SMP MP-table at 0009ad40 (XEN) DMI 2.4 present. (XEN) Using APIC driver default (XEN) ACPI: PM-Timer IO Port: 0x588 (XEN) ACPI: ACPI SLEEP INFO: pm1x_cnt[584,0], pm1x_evt[580,0] (XEN) ACPI: wakeup_vec[bffcab0c], vec_size[20] (XEN) ACPI: Local APIC address 0xfee00000 (XEN) ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled) (XEN) Processor #0 7:7 APIC version 20 (XEN) ACPI: LAPIC (acpi_id[0x01] lapic_id[0x01] enabled) (XEN) Processor #1 7:7 APIC version 20 (XEN) ACPI: LAPIC (acpi_id[0x02] lapic_id[0x02] enabled) (XEN) Processor #2 7:7 APIC version 20 (XEN) ACPI: LAPIC (acpi_id[0x03] lapic_id[0x03] enabled) (XEN) Processor #3 7:7 APIC version 20 (XEN) ACPI: LAPIC (acpi_id[0x04] lapic_id[0x04] enabled) (XEN) Processor #4 7:7 APIC version 20 (XEN) ACPI: LAPIC (acpi_id[0x05] lapic_id[0x05] enabled) (XEN) Processor #5 7:7 APIC version 20 (XEN) ACPI: LAPIC (acpi_id[0x06] lapic_id[0x06] enabled) (XEN) Processor #6 7:7 APIC version 20 (XEN) ACPI: LAPIC (acpi_id[0x07] lapic_id[0x07] enabled) (XEN) Processor #7 7:7 APIC version 20 (XEN) ACPI: LAPIC_NMI (acpi_id[0x00] dfl dfl lint[0x1]) (XEN) ACPI: LAPIC_NMI (acpi_id[0x01] dfl dfl lint[0x1]) (XEN) ACPI: LAPIC_NMI (acpi_id[0x02] dfl dfl lint[0x1]) (XEN) ACPI: LAPIC_NMI (acpi_id[0x03] dfl dfl lint[0x1]) (XEN) ACPI: LAPIC_NMI (acpi_id[0x04] dfl dfl lint[0x1]) (XEN) ACPI: LAPIC_NMI (acpi_id[0x05] dfl dfl lint[0x1]) (XEN) ACPI: LAPIC_NMI (acpi_id[0x06] dfl dfl lint[0x1]) (XEN) ACPI: LAPIC_NMI (acpi_id[0x07] dfl dfl lint[0x1]) (XEN) ACPI: IOAPIC (id[0x0e] address[0xfec00000] gsi_base[0]) (XEN) IOAPIC[0]: apic_id 14, version 32, address 0xfec00000, GSI 0-23 (XEN) ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl) (XEN) ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level) (XEN) ACPI: IRQ0 used by override. (XEN) ACPI: IRQ2 used by override. (XEN) ACPI: IRQ9 used by override. (XEN) Enabling APIC mode: Flat. Using 1 I/O APICs (XEN) ACPI: HPET id: 0x8086a201 base: 0xfed00000 (XEN) PCI: MCFG configuration 0: base e0000000 segment 0 buses 0 - 20 (XEN) PCI: MCFG area at e0000000 reserved in E820 (XEN) Using ACPI (MADT) for SMP configuration information (XEN) Using scheduler: SMP Credit Scheduler (credit) (XEN) Detected 2493.798 MHz processor. (XEN) Initing memory sharing. (XEN) VMX: Supported advanced features: (XEN) - APIC MMIO access virtualisation (XEN) - APIC TPR shadow (XEN) - Virtual NMI (XEN) - MSR direct-access bitmap (XEN) HVM: ASIDs disabled. (XEN) HVM: VMX enabled (XEN) Intel machine check reporting enabled (XEN) I/O virtualisation disabled (XEN) Total of 8 processors activated. (XEN) ENABLING IO-APIC IRQs (XEN) -> Using new ACK method (XEN) ..TIMER: vector=3D0xF0 apic1=3D0 pin1=3D2 apic2=3D-1 pin2=3D-1 (XEN) checking TSC synchronization across 8 CPUs: passed. (XEN) Platform timer is 14.318MHz HPET (XEN) Allocated console ring of 64 KiB. (XEN) microcode.c:73:d32767 microcode: CPU2 resumed (XEN) microcode.c:73:d32767 microcode: CPU1 resumed (XEN) microcode.c:73:d32767 microcode: CPU3 resumed (XEN) Brought up 8 CPUs (XEN) microcode.c:73:d32767 microcode: CPU4 resumed (XEN) microcode.c:73:d32767 microcode: CPU5 resumed (XEN) microcode.c:73:d32767 microcode: CPU6 resumed (XEN) microcode.c:73:d32767 microcode: CPU7 resumed (XEN) HPET: 3 timers in total, 0 timers will be used for broadcast (XEN) ACPI sleep modes: S3 (XEN) mcheck_poll: Machine check polling timer started. (XEN) *** LOADING DOMAIN 0 *** (XEN) Xen kernel: 64-bit, lsb, compat32 (XEN) Dom0 kernel: 64-bit, PAE, lsb, paddr 0x1000000 -> 0x16b2000 (XEN) PHYSICAL MEMORY ARRANGEMENT: (XEN) Dom0 alloc.: 00000002b4000000->00000002b8000000 (114688 pages to b= e allocated) (XEN) VIRTUAL MEMORY ARRANGEMENT: (XEN) Loaded kernel: ffffffff81000000->ffffffff816b2000 (XEN) Init. ramdisk: ffffffff816b2000->ffffffff82e05400 (XEN) Phys-Mach map: ffffffff82e06000->ffffffff82f06000 (XEN) Start info: ffffffff82f06000->ffffffff82f064b4 (XEN) Page tables: ffffffff82f07000->ffffffff82f22000 (XEN) Boot stack: ffffffff82f22000->ffffffff82f23000 (XEN) TOTAL: ffffffff80000000->ffffffff83000000 (XEN) ENTRY ADDRESS: ffffffff81502200 (XEN) Dom0 has maximum 1 VCPUs (XEN) Scrubbing Free RAM: ...........................................................................= .....................done. (XEN) trace.c:89:d32767 calc_tinfo_first_offset: NR_CPUs 128, offset_in_bytes 258, t_info_first_offset 65 (XEN) Xen trace buffers: disabled (XEN) Std. Loglevel: All (XEN) Guest Loglevel: All (XEN) Xen is relinquishing VGA console. (XEN) *** Serial input -> DOM0 (type 'CTRL-a' three times to switch input t= o Xen) (XEN) Freed 176kB init memory. (XEN) PCI add device 00:00.0 (XEN) PCI add device 00:02.0 (XEN) PCI add device 00:03.0 (XEN) PCI add device 00:04.0 (XEN) PCI add device 00:05.0 (XEN) PCI add device 00:06.0 (XEN) PCI add device 00:07.0 (XEN) PCI add device 00:08.0 (XEN) PCI add device 00:10.0 (XEN) PCI add device 00:10.1 (XEN) PCI add device 00:10.2 (XEN) PCI add device 00:11.0 (XEN) PCI add device 00:13.0 (XEN) PCI add device 00:15.0 (XEN) PCI add device 00:16.0 (XEN) PCI add device 00:1c.0 (XEN) PCI add device 00:1d.0 (XEN) PCI add device 00:1d.1 (XEN) PCI add device 00:1d.2 (XEN) PCI add device 00:1d.7 (XEN) PCI add device 00:1e.0 (XEN) PCI add device 00:1f.0 (XEN) PCI add device 00:1f.1 (XEN) PCI add device 00:1f.3 (XEN) PCI add device 10:00.0 (XEN) PCI add device 10:00.3 (XEN) PCI add device 11:00.0 (XEN) PCI add device 11:01.0 (XEN) PCI add device 07:00.0 (XEN) PCI add device 07:00.1 (XEN) PCI add device 03:00.0 (XEN) PCI add device 04:00.0 (XEN) PCI add device 02:00.0 (XEN) PCI add device 05:00.0 (XEN) PCI add device 06:00.0 (XEN) PCI add device 01:01.0 When the issue append : (XEN) Platform timer appears to have unexpectedly wrapped 10 or more times. Output of xm debug-key s : (XEN) TSC has constant rate, deep Cstates possible, so not reliable, warp=3D2684 (count=3D4) (XEN) dom1: mode=3D0,ofs=3D0xa8dcbfb9a,khz=3D2493798,inc=3D1,vtsc count: 17= 56100739 kernel, 20526533 user (XEN) dom2: mode=3D0,ofs=3D0xc257d49df,khz=3D2493798,inc=3D1,vtsc count: 90= 0668266 kernel, 30618121 user (XEN) dom3: mode=3D0,ofs=3D0xdb1299744,khz=3D2493798,inc=3D1,vtsc count: 16= 656509047 kernel, 709406217 user (XEN) dom4: mode=3D0,ofs=3D0xf8627e616,khz=3D2493798,inc=3D1,vtsc count: 11= 74828915 kernel, 194957775 user (XEN) dom5: mode=3D0,ofs=3D0x115a0f2a67,khz=3D2493798,inc=3D1,vtsc count: 3= 32007967 kernel, 5766769 user (XEN) dom6: mode=3D0,ofs=3D0x13bf462f38,khz=3D2493798,inc=3D1,vtsc count: 3= 137076938 kernel, 1076320679 user (XEN) dom10: mode=3D0,ofs=3D0x1b99e41f4b,khz=3D2493798,inc=3D1,vtsc count: = 411433049 kernel, 19532319 user (XEN) dom11: mode=3D0,ofs=3D0x1e4991cf40,khz=3D2493798,inc=3D1,vtsc count: = 415406148 kernel, 19223482 user (XEN) dom12: mode=3D0,ofs=3D0x1fe8c10600,khz=3D2493798,inc=3D1,vtsc count: 1012850399 kernel, 63603352 user (XEN) dom13: mode=3D0,ofs=3D0x21ef9b9531,khz=3D2493798,inc=3D1,vtsc count: = 813097186 kernel, 27536004 user (XEN) dom14: mode=3D0,ofs=3D0x23f5b4e429,khz=3D2493798,inc=3D1,vtsc count: 2461059718 kernel, 48182776 user (XEN) dom18: mode=3D0,ofs=3D0x2bdc302048,khz=3D2493798,inc=3D1,vtsc count: = 624333824 kernel, 5166805 user (XEN) dom19: mode=3D0,ofs=3D0x2e67227085,khz=3D2493798,inc=3D1,vtsc count: 1037952789 kernel, 5778635 user (XEN) dom20: mode=3D0,ofs=3D0x562ce020eea4,khz=3D2493798,inc=3D1,vtsc count= : 643491360 kernel, 31771029 user (XEN) dom21: mode=3D0,ofs=3D0x563a017eea82,khz=3D2493798,inc=3D1,vtsc count= : 715148727 kernel, 24430809 user (XEN) dom25: mode=3D0,ofs=3D0x1d0c5230cdfad,khz=3D2493798,inc=3D1,vtsc coun= t: 2103227324 kernel, 656635140 user (XEN) dom27: mode=3D0,ofs=3D0x1d868b8c1fbbf,khz=3D2493798,inc=3D1,vtsc coun= t: 476542178 kernel, 12976786 user (XEN) dom31: mode=3D0,ofs=3D0x1dc08da161ebc,khz=3D2493798,inc=3D1,vtsc coun= t: 2747233178 kernel, 466863700 user (XEN) dom32: mode=3D0,ofs=3D0x1ecde6eb53d2c,khz=3D2493798,inc=3D1,vtsc coun= t: 305360096 kernel, 11705823 user (XEN) dom33: mode=3D0,ofs=3D0x1ece1bf734f61,khz=3D2493798,inc=3D1,vtsc coun= t: 516548852 kernel, 18662125 user Output of xm debug-key t : (XEN) Synced stime skew: max=3D1405ns avg=3D1405ns samples=3D1 current=3D14= 05ns (XEN) Synced cycles skew: max=3D2377 avg=3D2377 samples=3D1 current=3D2377 Output of /proc/cpuinfo : processor : 0 vendor_id : GenuineIntel cpu family : 6 model : 23 model name : Intel(R) Xeon(R) CPU L5420 @ 2.50GHz stepping : 6 cpu MHz : 2493.798 cache size : 6144 KB fpu : yes fpu_exception : yes cpuid level : 10 wp : yes flags : fpu de tsc msr pae mce cx8 apic sep mtrr mca cmov pat clflush acpi mmx fxsr sse sse2 ss ht syscall lm constant_tsc up rep_good aperfmperf pni est ssse3 cx16 sse4_1 hypervisor lahf_lm bogomips : 4987.59 clflush size : 64 cache_alignment : 64 address sizes : 38 bits physical, 48 bits virtual power management: Output of xm info : release : 2.6.32-bpo.5-xen-amd64 version : #1 SMP Mon Jan 17 22:05:11 UTC 2011 machine : x86_64 nr_cpus : 8 nr_nodes : 1 cores_per_socket : 4 threads_per_core : 1 cpu_mhz : 2493 hw_caps : bfebfbff:20000800:00000000:00000940:000ce3bd:00000000:00000001:00000000 virt_caps : hvm total_memory : 10239 free_memory : 910 node_to_cpu : node0:0-7 node_to_memory : node0:910 node_to_dma32_mem : node0:910 max_node_id : 0 xen_major : 4 xen_minor : 0 xen_extra : .1 xen_caps : xen-3.0-x86_64 xen-3.0-x86_32p hvm-3.0-x86_32 hvm-3.0-x86_32p hvm-3.0-x86_64 xen_scheduler : credit xen_pagesize : 4096 platform_params : virt_start=3D0xffff800000000000 xen_changeset : unavailable xen_commandline : dom0_mem=3D512M loglvl=3Dall guest_loglvl=3Dall dom0_max_vcpus=3D1 dom0_vcpus_pin console=3Dvga,com1 com1=3D19200,8n1 cc_compiler : gcc version 4.4.5 (Debian 4.4.5-10) cc_compile_by : waldi cc_compile_domain : debian.org cc_compile_date : Wed Jan 12 14:04:06 UTC 2011 xend_config_format : 4 in dom0 /var/log/kern.log : Feb 23 22:40:54 dom0 kernel: [995452.618519] Clocksource tsc unstable (delt= a =3D -2999660335950 ns) in domU, I don't see any logs, the time just "jumps" 50min in the future (see /var/log/daemon.log) Feb 23 21:50:51 domU snmpd[1037]: Connection from UDP: [10.16.2.101]:58303 Feb 23 22:40:55 domU snmpd[1037]: Connection from UDP: [10.16.2.101]:45713 Clocksource is set to "xen" to both dom0 et domU : cat /sys/devices/system/clocksource/clocksource0/current_clocksource Regards Olivier 2011/2/24 Keir Fraser > Please send Xen boot output (xm dmesg). Getting it from Xen 3.2 as well > would be interesting, if you still have it installed on any of these > machines. > > -- Keir > > On 23/02/2011 19:04, "Olivier Hanesse" wrote: > > > I am sorry for the lack of information. > > Every domUs on the dom0 are affected by this bug at the exact same time= . > > > > And I had this bug on a dozen servers (all running on the same hw) sinc= e > > October (when I switched from Xen 3.2 to 4.0). > > > > Regards > > > > Olivier > > > > Le 23/02/2011 18:19, Keir Fraser a =E9crit : > >> On 23/02/2011 16:16, "Dan Magenheimer" > wrote: > >> > >>> It=B9s very unlikely this is a problem with TSC. It is most likely a = Xen > (or > >>> possibly a PV Linux) problem where a guest (or dom0) either =B3goes o= ut > to > >>> lunch=B2 for a long period, or some other timer gets stuck. The > =B3clocksource > >>> tsc unstable=B2 message is a side effect of this... it=B9s very likel= y the > TSC > >>> that IS stable and correct and the other clocksource (pvclock) has > >>> lost/gained > >>> 50 minutes! > >>> > >>> Mark Adams cc=B9ed and his original xen-devel posting below. The fac= t > that > >>> two > >>> different users (possibly on the same processor/system type?) have > submitted > >>> the message with a delta so similar would lead me to believe there is > some > >>> timer that is =B3wrapping=B2. And since pvclock is usually the clock= source > for > >>> dom0, and pvclock is driven! by Xen=B9s =B3system time=B2, a reasona= ble > guess is > >>> that the timer that is wrapping is in Xen itself. > >>> > >>> Mark=B9s delta =3D -2999660303788 ns > >>> Your delta =3D -2999660334211 ns > >>> > >>> Googling, I see the HPET wraparound is ~306 seconds and this delta is > about > >>> 3000 seconds, so that may be a bad guess. > >>> > >>> Keir, any thoughts on this? Do you recall any post-4.0 patches that > may > >>> have > >>> fixed this? > >> I've never seen a 3000s wrap, and I don't know of anything that would > have > >> fixed a bug like this. If this is a Xen time wrap of some kind then it > would > >> affect all running guests; it's not clear here whether only one, or al= l, > >> guests see the wrap. > >> > >> K. > >> > >>> Thanks, > >>> Dan > >>> > >>> References: > >>> > http://lists.xensource.com/archives/html/xen-devel/2010-10/msg00210.html > >>> https://lkml.org/lkml/2010/10/26/126 > >>> > >>> > >>> From: Olivier Hanesse [mailto:olivier.hanesse@gmail.com] > >>> Sent: Wednesday, February 23, 2011 3:50 AM > >>> To: xen-devel@lists.xensource.co! m; Xen Users > >>> Subject: [Xen-devel] Xen 4 TSC problems > >>> > >>> > >>> Hello > >>> > >>> > >>> > >>> I've got an issue about time keeping with Xen 4.0 (Debian squeeze > release). > >>> > >>> > >>> > >>> My problem is here (hopefully I amn't the only one, so there might be= a > bug > >>> somewhere) : > http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=3D599161#50 > >>> > >>> After some times, I got this error : Clocksource tsc unstable (delta= =3D > >>> -2999660334211 ns). It has happened on several servers. > >>> > >>> > >>> > >>> Looking at the output of "xm debug-key s;" > >>> > >>> > >>> > >>> (XEN) TSC has constant rate, deep Cstates possible, so not reliable, > >>> warp=3D2850 > >>> (count=3D3) > >>> > >>> > >>> > >>> I am using a "Intel(R) Xeon(R) CPU L5420 @ 2.50GHz", which has the > >>> "constant_tsc", but not the "nonstop_tsc" one. > >>> > >>> On other systems with a newer cpu with "nonstop_tsc", I don't have th= is > >>> issue > >>> (systems are running the same distros with same config). > >>> > >>> > >>> > >>> I tried to boot with "max_cstate=3D0", but nothing changed, my TSC is= n't > >>> reliable and after some times, I will got the "50min" issue again. > >>> > >>> > >>> > >>> I don't unders! tand how a system can do a jump of "50min" in the > future. > >>> Why > >>> 50min ? it is not 40min, not 1 hour, it is always 50min. > >>> > >>> I don't know how to make my TSC "reliable" (I already disable > everything > >>> about > >>> Powerstate in BIOS Settings). > >>> > >>> > >>> > >>> Any ideas ? > >>> > >>> > >>> > >>> Regards > >>> > >>> > >>> > >>> Olivier > >>> > >> > > > > > --0015175caae4859f9a049d0443f3 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
xm dmesg :=A0

(XEN) =A0Found 2 EDD information structures
(XEN) Xen-e820 R= AM map:
(XEN) =A00000000000000000 - 000000000009ac00 (usable)
(XEN) =A0000000000009ac00 - 00000000000a0000 (reserved)
(X= EN) =A000000000000e0000 - 0000000000100000 (reserved)
(XEN) =A00000000000100000 - 00000000bffc7980 (usable)
(XEN) = =A000000000bffc7980 - 00000000bffcee80 (ACPI data)
(XEN) =A000000= 000bffcee80 - 00000000c0000000 (reserved)
(XEN) =A000000000e00000= 00 - 00000000f0000000 (reserved)
(XEN) =A000000000fec00000 - 0000000100000000 (reserved)
(XEN= ) =A00000000100000000 - 00000002c0000000 (usable)
(XEN) ACPI: RSD= P 000FDFD0, 0024 (r2 IBM =A0 )
(XEN) ACPI: XSDT BFFCED40, 0054 (r= 1 IBM =A0 =A0SERDEFNT =A0 =A0 1000 IBM =A045444F43)
(XEN) ACPI: FACP BFFCEC80, 0084 (r2 IBM =A0 =A0SERDEFNT =A0 =A0 1000 I= BM =A045444F43)
(XEN) ACPI: DSDT BFFC7980, 2EDA (r2 IBM =A0 =A0SE= RDEFNT =A0 =A0 1000 INTL 20041203)
(XEN) ACPI: FACS BFFCAB00, 004= 0
(XEN) ACPI: APIC BFFCEB80, 00BC (r1 IBM =A0 =A0SERDEFNT =A0 =A0= 1000 IBM =A045444F43)
(XEN) ACPI: SRAT BFFCEA00, 0128 (r1 IBM =A0 =A0SERDEFNT =A0 =A0 1000 I= BM =A045444F43)
(XEN) ACPI: HPET BFFCE9C0, 0038 (r1 IBM =A0 =A0SE= RDEFNT =A0 =A0 1000 IBM =A045444F43)
(XEN) ACPI: MCFG BFFCE980, 0= 03C (r1 IBM =A0 =A0SERDEFNT =A0 =A0 1000 IBM =A045444F43)
(XEN) ACPI: ERST BFFCAB40, 0230 (r1 IBM =A0 =A0SERDEFNT =A0 =A0 1000 I= BM =A045444F43)
(XEN) System RAM: 10239MB (10485124kB)
= (XEN) SRAT: PXM 0 -> APIC 0 -> Node 0
(XEN) SRAT: PXM 0 -&g= t; APIC 1 -> Node 0
(XEN) SRAT: PXM 0 -> APIC 2 -> Node 0
(XEN) SRAT: PXM = 0 -> APIC 3 -> Node 0
(XEN) SRAT: PXM 0 -> APIC 4 -> = Node 0
(XEN) SRAT: PXM 0 -> APIC 5 -> Node 0
(XEN) SRAT: PXM 0 -> APIC 6 -> Node 0
(XEN) SRAT: PXM 0 -&g= t; APIC 7 -> Node 0
(XEN) SRAT: Node 0 PXM 0 0-c0000000
<= div>(XEN) SRAT: Node 0 PXM 0 100000000-2c0000000
(XEN) SRAT: hot = plug zone found 2c0000000 - 1000000000=A0
(XEN) SRAT: Node 0 PXM 0 2c0000000-1000000000
(XEN) NUMA: Al= located memnodemap from 2bfdfe000 - 2bfdff000
(XEN) NUMA: Using 1= 8 for the hash shift.
(XEN) Domain heap initialised
(XEN) found SMP MP-table at 0009ad40
(XEN) DMI 2.4 present.
=
(XEN) Using APIC driver default
(XEN) ACPI: PM-Timer IO Port= : 0x588
(XEN) ACPI: ACPI SLEEP INFO: pm1x_cnt[584,0], pm1x_evt[58= 0,0]
(XEN) ACPI: =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0wakeup_vec[bffcab0c], v= ec_size[20]
(XEN) ACPI: Local APIC address 0xfee00000
(= XEN) ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled)
(XEN) Pro= cessor #0 7:7 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x01] lapic_id[0x01] enabled)
(XE= N) Processor #1 7:7 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x= 02] lapic_id[0x02] enabled)
(XEN) Processor #2 7:7 APIC version 2= 0
(XEN) ACPI: LAPIC (acpi_id[0x03] lapic_id[0x03] enabled)
(XE= N) Processor #3 7:7 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x= 04] lapic_id[0x04] enabled)
(XEN) Processor #4 7:7 APIC version 2= 0
(XEN) ACPI: LAPIC (acpi_id[0x05] lapic_id[0x05] enabled)
(XE= N) Processor #5 7:7 APIC version 20
(XEN) ACPI: LAPIC (acpi_id[0x= 06] lapic_id[0x06] enabled)
(XEN) Processor #6 7:7 APIC version 2= 0
(XEN) ACPI: LAPIC (acpi_id[0x07] lapic_id[0x07] enabled)
(XE= N) Processor #7 7:7 APIC version 20
(XEN) ACPI: LAPIC_NMI (acpi_i= d[0x00] dfl dfl lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x01] d= fl dfl lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x02] dfl dfl lint[0x1])
(XEN= ) ACPI: LAPIC_NMI (acpi_id[0x03] dfl dfl lint[0x1])
(XEN) ACPI: L= APIC_NMI (acpi_id[0x04] dfl dfl lint[0x1])
(XEN) ACPI: LAPIC_NMI = (acpi_id[0x05] dfl dfl lint[0x1])
(XEN) ACPI: LAPIC_NMI (acpi_id[0x06] dfl dfl lint[0x1])
(XEN= ) ACPI: LAPIC_NMI (acpi_id[0x07] dfl dfl lint[0x1])
(XEN) ACPI: I= OAPIC (id[0x0e] address[0xfec00000] gsi_base[0])
(XEN) IOAPIC[0]:= apic_id 14, version 32, address 0xfec00000, GSI 0-23
(XEN) ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
(XEN) ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 high level)
<= div>(XEN) ACPI: IRQ0 used by override.
(XEN) ACPI: IRQ9 used by override.
(XEN) Enabling APIC mode:= =A0Flat. =A0Using 1 I/O APICs
(XEN) ACPI: HPET id: 0x8086a201 ba= se: 0xfed00000
(XEN) PCI: MCFG configuration 0: base e0000000 seg= ment 0 buses 0 - 20
(XEN) PCI: MCFG area at e0000000 reserved in E820
(XEN) Usin= g ACPI (MADT) for SMP configuration information
(XEN) Using sched= uler: SMP Credit Scheduler (credit)
(XEN) Detected 2493.798 MHz p= rocessor.
(XEN) Initing memory sharing.
(XEN) VMX: Supported advanced = features:
(XEN) =A0- APIC MMIO access virtualisation
(X= EN) =A0- APIC TPR shadow
(XEN) =A0- Virtual NMI
(XEN) = =A0- MSR direct-access bitmap
(XEN) HVM: ASIDs disabled.
(XEN) HVM: VMX enabled
= (XEN) Intel machine check reporting enabled
(XEN) I/O virtualisat= ion disabled
(XEN) Total of 8 processors activated.
(XEN) ENABLING IO-APIC IRQs
(XEN) =A0-> Using new ACK method
(XEN) ..TIMER: vector=3D0xF0 apic1=3D0 pin1=3D2 apic2=3D-1 pin2=3D= -1
(XEN) checking TSC synchronization across 8 CPUs: passed.
(XEN) Platform timer is 14.318MHz HPET
(XEN) Allocated console ring of 64 KiB.
(XEN) microcode.c:73= :d32767 microcode: CPU2 resumed
(XEN) microcode.c:73:d32767 micro= code: CPU1 resumed
(XEN) microcode.c:73:d32767 microcode: CPU3 re= sumed
(XEN) Brought up 8 CPUs
(XEN) microcode.c:73:d32767 microcod= e: CPU4 resumed
(XEN) microcode.c:73:d32767 microcode: CPU5 resum= ed
(XEN) microcode.c:73:d32767 microcode: CPU6 resumed
(XEN) microcode.c:73:d32767 microcode: CPU7 resumed
(XEN) HPET: 3= timers in total, 0 timers will be used for broadcast
(XEN) ACPI = sleep modes: S3
(XEN) mcheck_poll: Machine check polling timer st= arted.
(XEN) *** LOADING DOMAIN 0 ***
(XEN) =A0Xen =A0kernel: 64-bi= t, lsb, compat32
(XEN) =A0Dom0 kernel: 64-bit, PAE, lsb, paddr 0x= 1000000 -> 0x16b2000
(XEN) PHYSICAL MEMORY ARRANGEMENT:
<= div> (XEN) =A0Dom0 alloc.: =A0 00000002b4000000->00000002b8000000 (114688 pag= es to be allocated)
(XEN) VIRTUAL MEMORY ARRANGEMENT:
(= XEN) =A0Loaded kernel: ffffffff81000000->ffffffff816b2000
(XEN= ) =A0Init. ramdisk: ffffffff816b2000->ffffffff82e05400
(XEN) =A0Phys-Mach map: ffffffff82e06000->ffffffff82f06000
(XEN) =A0Start info: =A0 =A0ffffffff82f06000->ffffffff82f064b4
(XEN) =A0Page tables: =A0 ffffffff82f07000->ffffffff82f22000
(XEN) =A0Boot stack: =A0 =A0ffffffff82f22000->ffffffff82f23000
(XEN) =A0TOTAL: =A0 =A0 =A0 =A0 ffffffff80000000->ffffffff83000000<= /div>
(XEN) =A0ENTRY ADDRESS: ffffffff81502200
(XEN) Dom0 has= maximum 1 VCPUs
(XEN) Scrubbing Free RAM: ......................= ..........................................................................d= one.
(XEN) trace.c:89:d32767 calc_tinfo_first_offset: NR_CPUs 128, offset_i= n_bytes 258, t_info_first_offset 65
(XEN) Xen trace buffers: disa= bled
(XEN) Std. Loglevel: All
(XEN) Guest Loglevel: All=
(XEN) Xen is relinquishing VGA console.
(XEN) *** Serial inp= ut -> DOM0 (type 'CTRL-a' three times to switch input to Xen)
(XEN) Freed 176kB init memory.
(XEN) PCI add device 00:00= .0
(XEN) PCI add device 00:02.0
(XEN) PCI add device 00:03.0
(XEN) PCI add device 00:04.0
(XEN) PCI add device 00:05.0=
(XEN) PCI add device 00:06.0
(XEN) PCI add device 00:0= 7.0
(XEN) PCI add device 00:08.0
(XEN) PCI add device 00:10.0
(XEN) PCI add device 00:10.1
(XEN) PCI add device 00:10.2=
(XEN) PCI add device 00:11.0
(XEN) PCI add device 00:1= 3.0
(XEN) PCI add device 00:15.0
(XEN) PCI add device 00:16.0
(XEN) PCI add device 00:1c.0
(XEN) PCI add device 00:1d.0=
(XEN) PCI add device 00:1d.1
(XEN) PCI add device 00:1= d.2
(XEN) PCI add device 00:1d.7
(XEN) PCI add device 00:1e.0
(XEN) PCI add device 00:1f.0
(XEN) PCI add device 00:1f.1=
(XEN) PCI add device 00:1f.3
(XEN) PCI add device 10:0= 0.0
(XEN) PCI add device 10:00.3
(XEN) PCI add device 11:00.0
(XEN) PCI add device 11:01.0
(XEN) PCI add device 07:00.0=
(XEN) PCI add device 07:00.1
(XEN) PCI add device 03:0= 0.0
(XEN) PCI add device 04:00.0
(XEN) PCI add device 02:00.0
(XEN) PCI add device 05:00.0
(XEN) PCI add device 06:00.0=
(XEN) PCI add device 01:01.0

When the issue append :=A0

(XEN) Platform tim= er appears to have unexpectedly wrapped 10 or more times.
<= br>
Output of xm debug-key s :

(XEN= ) TSC has constant rate, deep Cstates possible, so not reliable, warp=3D268= 4 (count=3D4)
(XEN) dom1: mode=3D0,ofs=3D0xa8dcbfb9a,khz=3D2493798,inc=3D1,vtsc coun= t: 1756100739 kernel, 20526533 user
(XEN) dom2: mode=3D0,ofs=3D0x= c257d49df,khz=3D2493798,inc=3D1,vtsc count: 900668266 kernel, 30618121 user=
(XEN) dom3: mode=3D0,ofs=3D0xdb1299744,khz=3D2493798,inc=3D1,vts= c count: 16656509047 kernel, 709406217 user
(XEN) dom4: mode=3D0,ofs=3D0xf8627e616,khz=3D2493798,inc=3D1,vtsc coun= t: 1174828915 kernel, 194957775 user
(XEN) dom5: mode=3D0,ofs=3D0= x115a0f2a67,khz=3D2493798,inc=3D1,vtsc count: 332007967 kernel, 5766769 use= r
(XEN) dom6: mode=3D0,ofs=3D0x13bf462f38,khz=3D2493798,inc=3D1,vtsc count: 3= 137076938 kernel, 1076320679 user
(XEN) dom10: mode=3D0,ofs=3D0x1= b99e41f4b,khz=3D2493798,inc=3D1,vtsc count: 411433049 kernel, 19532319 user=
(XEN) dom11: mode=3D0,ofs=3D0x1e4991cf40,khz=3D2493798,inc=3D1,v= tsc count: 415406148 kernel, 19223482 user
(XEN) dom12: mode=3D0,ofs=3D0x1fe8c10600,khz=3D2493798,inc=3D1,vtsc co= unt: 1012850399 kernel, 63603352 user
(XEN) dom13: mode=3D0,ofs= =3D0x21ef9b9531,khz=3D2493798,inc=3D1,vtsc count: 813097186 kernel, 2753600= 4 user
(XEN) dom14: mode=3D0,ofs=3D0x23f5b4e429,khz=3D2493798,inc=3D1,vtsc count: = 2461059718 kernel, 48182776 user
(XEN) dom18: mode=3D0,ofs=3D0x2b= dc302048,khz=3D2493798,inc=3D1,vtsc count: 624333824 kernel, 5166805 user
(XEN) dom19: mode=3D0,ofs=3D0x2e67227085,khz=3D2493798,inc=3D1,vts= c count: 1037952789 kernel, 5778635 user
(XEN) dom20: mode=3D0,ofs=3D0x562ce020eea4,khz=3D2493798,inc=3D1,vtsc = count: 643491360 kernel, 31771029 user
(XEN) dom21: mode=3D0,ofs= =3D0x563a017eea82,khz=3D2493798,inc=3D1,vtsc count: 715148727 kernel, 24430= 809 user
(XEN) dom25: mode=3D0,ofs=3D0x1d0c5230cdfad,khz=3D2493798,inc=3D1,vtsc= count: 2103227324 kernel, 656635140 user
(XEN) dom27: mode=3D0,o= fs=3D0x1d868b8c1fbbf,khz=3D2493798,inc=3D1,vtsc count: 476542178 kernel, 12= 976786 user
(XEN) dom31: mode=3D0,ofs=3D0x1dc08da161ebc,khz=3D2493798,inc=3D1,vtsc= count: 2747233178 kernel, 466863700 user
(XEN) dom32: mode=3D0,o= fs=3D0x1ecde6eb53d2c,khz=3D2493798,inc=3D1,vtsc count: 305360096 kernel, 11= 705823 user
(XEN) dom33: mode=3D0,ofs=3D0x1ece1bf734f61,khz=3D2493798,inc=3D1,vtsc= count: 516548852 kernel, 18662125 user


=
Output of xm debug-key t :

(XEN) Synced stime skew: max=3D1405ns avg=3D1405ns= samples=3D1 current=3D1405ns
(XEN) Synced cycles skew: max=3D237= 7 avg=3D2377 samples=3D1 current=3D2377

Outp= ut of /proc/cpuinfo :=A0

processor =A0 =A0 =A0 : 0
vendor_id =A0 = =A0 =A0 : GenuineIntel
cpu family =A0 =A0 =A0: 6
model = =A0 =A0 =A0 =A0 =A0 : 23
model name =A0 =A0 =A0: Intel(R) Xeon(R)= CPU =A0 =A0 =A0 =A0 =A0 L5420 =A0@ 2.50GHz
stepping =A0 =A0 =A0 =A0: 6
cpu MHz =A0 =A0 =A0 =A0 : 2493.7= 98
cache size =A0 =A0 =A0: 6144 KB
fpu =A0 =A0 =A0 =A0 = =A0 =A0 : yes
fpu_exception =A0 : yes
cpuid level =A0 = =A0 : 10
wp =A0 =A0 =A0 =A0 =A0 =A0 =A0: yes
flags =A0 =A0 =A0 =A0 =A0 : fpu de tsc msr pae mce cx8 apic sep mtrr m= ca cmov pat clflush acpi mmx fxsr sse sse2 ss ht syscall lm constant_tsc up= rep_good aperfmperf pni est ssse3 cx16 sse4_1 hypervisor lahf_lm
bogomips =A0 =A0 =A0 =A0: 4987.59
clflush size =A0 =A0: 64
cache_alignment : 64
addr= ess sizes =A0 : 38 bits physical, 48 bits virtual
power managemen= t:

Output of xm info :=A0

release =A0 =A0 =A0 =A0 =A0= =A0 =A0 =A0: 2.6.32-bpo.5-xen-amd64
version =A0 =A0 =A0 =A0 =A0 = =A0 =A0 =A0: #1 SMP Mon Jan 17 22:05:11 UTC 2011
machine =A0 =A0 = =A0 =A0 =A0 =A0 =A0 =A0: x86_64
nr_cpus =A0 =A0 =A0 =A0 =A0 =A0 = =A0 =A0: 8
nr_nodes =A0 =A0 =A0 =A0 =A0 =A0 =A0 : 1
cores_per_socket = =A0 =A0 =A0 : 4
threads_per_core =A0 =A0 =A0 : 1
cpu_mh= z =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0: 2493
hw_caps =A0 =A0 =A0 =A0 = =A0 =A0 =A0 =A0: bfebfbff:20000800:00000000:00000940:000ce3bd:00000000:0000= 0001:00000000
virt_caps =A0 =A0 =A0 =A0 =A0 =A0 =A0: hvm
total_memory =A0 = =A0 =A0 =A0 =A0 : 10239
free_memory =A0 =A0 =A0 =A0 =A0 =A0: 910<= /div>
node_to_cpu =A0 =A0 =A0 =A0 =A0 =A0: node0:0-7
node_to_= memory =A0 =A0 =A0 =A0 : node0:910
node_to_dma32_mem =A0 =A0 =A0:= node0:910
max_node_id =A0 =A0 =A0 =A0 =A0 =A0: 0
xen_major =A0 =A0 =A0= =A0 =A0 =A0 =A0: 4
xen_minor =A0 =A0 =A0 =A0 =A0 =A0 =A0: 0
xen_extra =A0 =A0 =A0 =A0 =A0 =A0 =A0: .1
xen_caps =A0 =A0 = =A0 =A0 =A0 =A0 =A0 : xen-3.0-x86_64 xen-3.0-x86_32p hvm-3.0-x86_32 hvm-3.0= -x86_32p hvm-3.0-x86_64=A0
xen_scheduler =A0 =A0 =A0 =A0 =A0: credit
xen_pagesize =A0 = =A0 =A0 =A0 =A0 : 4096
platform_params =A0 =A0 =A0 =A0: virt_star= t=3D0xffff800000000000
xen_changeset =A0 =A0 =A0 =A0 =A0: unavail= able
xen_commandline =A0 =A0 =A0 =A0: dom0_mem=3D512M loglvl=3Dal= l guest_loglvl=3Dall dom0_max_vcpus=3D1 dom0_vcpus_pin console=3Dvga,com1 c= om1=3D19200,8n1
cc_compiler =A0 =A0 =A0 =A0 =A0 =A0: gcc version 4.4.5 (Debian 4.4.5-1= 0)=A0
cc_compile_by =A0 =A0 =A0 =A0 =A0: waldi
cc_compi= le_domain =A0 =A0 =A0: debian.org
cc_compile_date =A0 =A0 =A0 =A0: Wed Jan 12 14:04:06 UTC 2011
xend_config_format =A0 =A0 : 4

in dom0 = /var/log/kern.log :

Feb 23 22:40:54 dom0 kern= el: [995452.618519] Clocksource tsc unstable (delta =3D -2999660335950 ns)<= /div>

in domU, I don't see any logs, the time just = "jumps" 50min in the future (see /var/log/daemon.log)
<= br>
Feb 23 21:50:51 domU snmpd[1037]: Connection from UDP: [10.16.2.101]:5= 8303
Feb 23 22:40:55 domU snmpd[1037]: Connection from UDP: [10.1= 6.2.101]:45713

Clocksource is set to &= quot;xen" to both dom0 et domU :
cat /sys/devices/system/clocksource/clocksource0/current_clocksource

Regards

Olivier
=
2011/2/24 Keir Fraser <keir.xen@gmail.com>
Please send Xen boot output (xm dmesg). Get= ting it from Xen 3.2 as well
would be interesting, if you still have it installed on any of these
machines.

=A0-- Keir

On 23/02/2011 19:04, "Olivier Hanesse" <olivier.hanesse@gmail.com> wrote:

> I am sorry for the lack of information.
> Every domUs on the dom0 are affected by this bug at the exact same tim= e.
>
> And I had this bug on a dozen servers (all running on the same hw) sin= ce
> October (when I switched from Xen 3.2 to 4.0).
>
> Regards
>
> Olivier
>
> Le 23/02/2011 18:19, Keir Fraser a =E9crit :
>> On 23/02/2011 16:16, "Dan Magenheimer"<dan.magenheimer@oracle.com> =A0wrote= :
>>
>>> It=B9s very unlikely this is a problem with TSC. It is m= ost likely a Xen (or
>>> possibly a PV Linux) problem where a guest (= or dom0) either =B3goes out to
>>> lunch=B2 for a long period, or some other timer gets stu= ck. =A0The =B3clocksource
>>> tsc unstable=B2 message is a side effect of this... it=B9s ver= y likely the TSC
>>> that IS stable and correct and the other clo= cksource (pvclock) has
>>> lost/gained
>>> 50 minutes!
>>>
>>> Mark Adams cc=B9ed and his original xen-devel posting be= low. =A0The fact that
>>> two
>>> different users (possibly on the same processor/system type?) = have submitted
>>> the message with a delta so similar would lead me to believe t= here is some
>>> timer that is =B3wrapping=B2. =A0And since pvclock is us= ually the clocksource for
>>> dom0, and pvclock is driven! =A0by Xen=B9s =B3system time=B2, = a reasonable guess is
>>> that the timer that is wrapping is in Xen it= self.
>>>
>>> Mark=B9s delta =3D -2999660303788 ns
>>> Your delta =3D -299966033421= 1 ns
>>>
>>> Googling, I see the HPET wraparound is ~306 seconds and this d= elta is about
>>> 3000 seconds, so that may be a bad guess.
>>>
>>> Keir, any thoughts on this? =A0Do you recall any post-4.0 patc= hes that may
>>> have
>>> fixed this?
>> I've never seen a 3000s wrap, and I don't know of anything= that would have
>> fixed a bug like this. If this is a Xen time wrap of some kind the= n it would
>> affect all running guests; it's not clear here whether only on= e, or all,
>> guests see the wrap.
>>
>> =A0 K.
>>
>>> Thanks,
>>> Dan
>>>
>>> References:
>>> http://lists.xensource.com/archive= s/html/xen-devel/2010-10/msg00210.html
>>> https://lkml.org/lkml/2010/10/26/126
>>>
>>>
>>> From: Olivier Hanesse [mailto:olivier.hanesse@gmail.com]
>>> Sent: Wednesday, February 23, 2011 3:50 AM
>>> To: xen-devel@= lists.xensource.co! =A0m; Xen Users
>>> Subject: [Xen-devel] Xen 4 TSC problems
>>>
>>>
>>> Hello
>>>
>>>
>>>
>>> I've got an issue about time keeping with Xen 4.0 (Debian = squeeze release).
>>>
>>>
>>>
>>> My problem is here (hopefully I amn't the only one, so the= re might be a bug
>>> somewhere) : http://bugs.debian.org/cgi-bin/bu= greport.cgi?bug=3D599161#50
>>>
>>> After some times, =A0I got this error : Clocksource tsc unstab= le (delta =3D
>>> -2999660334211 ns). It has happened on several servers.
>>>
>>>
>>>
>>> Looking at the output of "xm debug-key s;"
>>>
>>>
>>>
>>> (XEN) TSC has constant rate, deep Cstates possible, so not rel= iable,
>>> warp=3D2850
>>> (count=3D3)
>>>
>>>
>>>
>>> I am using a "Intel(R) Xeon(R) CPU L5420 =A0@ 2.50GHz&quo= t;, which has the
>>> "constant_tsc", but not the "nonstop_tsc" = one.
>>>
>>> On other systems with a newer cpu with "nonstop_tsc"= , I don't have this
>>> issue
>>> (systems are running the same distros with same config).
>>>
>>>
>>>
>>> I tried to boot with "max_cstate=3D0", but nothing c= hanged, my TSC isn't
>>> reliable and after some times, I will got the "50min"= ; issue again.
>>>
>>>
>>>
>>> I don't unders! =A0tand how a system can do a jump of &quo= t;50min" in the future.
>>> Why
>>> 50min ? it is not 40min, not 1 hour, it is always 50min.
>>>
>>> I don't know how to make my TSC "reliable" (I al= ready disable everything
>>> about
>>> Powerstate in BIOS Settings).
>>>
>>>
>>>
>>> Any ideas ?
>>>
>>>
>>>
>>> Regards
>>>
>>>
>>>
>>> Olivier
>>>
>>
>



--0015175caae4859f9a049d0443f3-- --===============1147676405== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xensource.com http://lists.xensource.com/xen-devel --===============1147676405==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Jan Beulich" Subject: Re: Xen 4 TSC problems Date: Thu, 24 Feb 2011 10:59:30 +0000 Message-ID: <4D6648220200007800033769@vpn.id2.novell.com> References: <4D655A39.2040501@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: Content-Disposition: inline List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Olivier Hanesse Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Jeremy Fitzhardinge , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org >>> On 24.02.11 at 10:59, Olivier Hanesse = wrote: > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more = times. >=20 > Output of xm debug-key s : >=20 > (XEN) TSC has constant rate, deep Cstates possible, so not reliable, > warp=3D2684 (count=3D4) Did you try turning of use of C states ("cpuidle=3D0" on the Xen command line)? Jan From mboxrd@z Thu Jan 1 00:00:00 1970 From: Keir Fraser Subject: Re: [Xen-devel] Xen 4 TSC problems Date: Thu, 24 Feb 2011 11:30:12 +0000 Message-ID: References: <4D6648220200007800033769@vpn.id2.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <4D6648220200007800033769@vpn.id2.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xensource.com Errors-To: xen-users-bounces@lists.xensource.com To: Jan Beulich , Olivier Hanesse Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Dan Magenheimer , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 24/02/2011 10:59, "Jan Beulich" wrote: >>>> On 24.02.11 at 10:59, Olivier Hanesse wrote: >> (XEN) Platform timer appears to have unexpectedly wrapped 10 or more times. >> >> Output of xm debug-key s : >> >> (XEN) TSC has constant rate, deep Cstates possible, so not reliable, >> warp=2684 (count=4) > > Did you try turning of use of C states ("cpuidle=0" on the Xen > command line)? Another thing to try is changing the platform timer that Xen uses. It's using HPET on your machines, so try clocksource=pit on Xen command line, and confirm that the 'Platform timer is xxx' message changes in xm dmesg. However, this bug looks more like a CPU's TSC jumping forward (or maybe backward) for some inexplicable reason. We added code post 3.2 to detect the platform timer counter wrapping, and to account for that based on trusting the CPU's 64-bit TSC. But if the TSC value is bogus then we can detect a wrap when it didn't happen and the new code will do more harm than good. It is not currently possible to disable the code via a boto parameter -- maybe we could add that. However, if the problem is a jumpy TSC then it is better to fix that as Xen relies so heavily on TSC for time handling. -- Keir > Jan > From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Re: [Xen-devel] Xen 4 TSC problems Date: Thu, 24 Feb 2011 12:57:15 +0100 Message-ID: References: <4D6648220200007800033769@vpn.id2.novell.com> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1556827642==" Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xensource.com Errors-To: xen-users-bounces@lists.xensource.com To: Keir Fraser Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Dan Magenheimer , Jan Beulich , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org --===============1556827642== Content-Type: multipart/alternative; boundary=0016e64caa7e21f5f6049d05ea9e --0016e64caa7e21f5f6049d05ea9e Content-Type: text/plain; charset=ISO-8859-1 Jan : I tried to turn off cstates with max_cstate=0 without success (still "not reliable"). With cpuidle=0, I also got : (XEN) TSC has constant rate, deep Cstates possible, so not reliable, warp=3022 (count=1) xm info | grep command xen_commandline : dom0_mem=512M cpuidle=0 loglvl=all guest_loglvl=all dom0_max_vcpus=1 dom0_vcpus_pin console=vga,com1 com1=19200,8n1 Keir : Using clocksource=pit : (XEN) Platform timer is 1.193MHz PIT I also got : (XEN) TSC has constant rate, deep Cstates possible, so not reliable, warp=3262 (count=2) 2011/2/24 Keir Fraser > On 24/02/2011 10:59, "Jan Beulich" wrote: > > >>>> On 24.02.11 at 10:59, Olivier Hanesse > wrote: > >> (XEN) Platform timer appears to have unexpectedly wrapped 10 or more > times. > >> > >> Output of xm debug-key s : > >> > >> (XEN) TSC has constant rate, deep Cstates possible, so not reliable, > >> warp=2684 (count=4) > > > > Did you try turning of use of C states ("cpuidle=0" on the Xen > > command line)? > > Another thing to try is changing the platform timer that Xen uses. It's > using HPET on your machines, so try clocksource=pit on Xen command line, > and > confirm that the 'Platform timer is xxx' message changes in xm dmesg. > > However, this bug looks more like a CPU's TSC jumping forward (or maybe > backward) for some inexplicable reason. We added code post 3.2 to detect > the > platform timer counter wrapping, and to account for that based on trusting > the CPU's 64-bit TSC. But if the TSC value is bogus then we can detect a > wrap when it didn't happen and the new code will do more harm than good. It > is not currently possible to disable the code via a boto parameter -- maybe > we could add that. However, if the problem is a jumpy TSC then it is better > to fix that as Xen relies so heavily on TSC for time handling. > > -- Keir > > > Jan > > > > > --0016e64caa7e21f5f6049d05ea9e Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Jan :=A0

I tried to turn off cstates with max_cstate= =3D0=A0without success (still "not reliable").

With cpuidle=3D0, I also got :

(XEN) TSC has constant rate, deep Cstates possible, so = not reliable, warp=3D3022 (count=3D1)

xm info= | grep command
xen_commandline =A0 =A0 =A0 =A0: dom0_mem=3D512M = cpuidle=3D0 loglvl=3Dall guest_loglvl=3Dall dom0_max_vcpus=3D1 dom0_vcpus_p= in console=3Dvga,com1 com1=3D19200,8n1

Keir :=A0

Using clocksou= rce=3Dpit :

(XEN) Platform timer is 1.19= 3MHz PIT

I also got : =A0

(XEN) TSC has constant rate, deep Cstates possible, so not reliable, warp= =3D3262 (count=3D2)

2011/2/24 Keir Fraser <keir@xen.org>
On 24/02/= 2011 10:59, "Jan Beulich" <JBeulich@novell.com> wrote:

>>>> On 24.02.11 at 10:59, Olivier Hanesse <olivier.hanesse@gmail.com> wrote:
>> (XEN) Platform timer appears to have unexpectedly wrapped 10 or mo= re times.
>>
>> Output of xm debug-key s :
>>
>> (XEN) TSC has constant rate, deep Cstates possible, so not reliabl= e,
>> warp=3D2684 (count=3D4)
>
> Did you try turning of use of C states ("cpuidle=3D0" on the= Xen
> command line)?

Another thing to try is changing the platform timer that Xen us= es. It's
using HPET on your machines, so try clocksource=3Dpit on Xen command line, = and
confirm that the 'Platform timer is xxx' message changes in xm dmes= g.

However, this bug looks more like a CPU's TSC jumping forward (or maybe=
backward) for some inexplicable reason. We added code post 3.2 to detect th= e
platform timer counter wrapping, and to account for that based on trusting<= br> the CPU's 64-bit TSC. But if the TSC value is bogus then we can detect = a
wrap when it didn't happen and the new code will do more harm than good= . It
is not currently possible to disable the code via a boto parameter -- maybe=
we could add that. However, if the problem is a jumpy TSC then it is better=
to fix that as Xen relies so heavily on TSC for time handling.

=A0-- Keir

> Jan
>



--0016e64caa7e21f5f6049d05ea9e-- --===============1556827642== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users --===============1556827642==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Jan Beulich" Subject: Re: Xen 4 TSC problems Date: Thu, 24 Feb 2011 12:37:53 +0000 Message-ID: <4D665F3102000078000337B6@vpn.id2.novell.com> References: <4D6648220200007800033769@vpn.id2.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: Content-Disposition: inline List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Olivier Hanesse Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Dan Magenheimer , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org >>> On 24.02.11 at 12:57, Olivier Hanesse = wrote: > I tried to turn off cstates with max_cstate=3D0 without success (still = "not > reliable"). >=20 > With cpuidle=3D0, I also got : >=20 > (XEN) TSC has constant rate, deep Cstates possible, so not reliable, > warp=3D3022 (count=3D1) This message by itself isn't telling much I believe. > xm info | grep command > xen_commandline : dom0_mem=3D512M cpuidle=3D0 loglvl=3Dall = guest_loglvl=3Dall > dom0_max_vcpus=3D1 dom0_vcpus_pin console=3Dvga,com1 com1=3D19200,8n1 >=20 > Keir : >=20 > Using clocksource=3Dpit : >=20 > (XEN) Platform timer is 1.193MHz PIT >=20 > I also got : >=20 > (XEN) TSC has constant rate, deep Cstates possible, so not reliable, > warp=3D3262 (count=3D2) The question is whether any of this eliminates the time jumps seen by your DomU-s (from your past mails I wasn't actually sure whether Dom0 also experienced this problem, albeit it would be odd if it didn't). Jan Jan From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Re: Xen 4 TSC problems Date: Thu, 24 Feb 2011 15:20:01 +0100 Message-ID: References: <4D6648220200007800033769@vpn.id2.novell.com> <4D665F3102000078000337B6@vpn.id2.novell.com> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1821090853==" Return-path: In-Reply-To: <4D665F3102000078000337B6@vpn.id2.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Jan Beulich Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Dan Magenheimer , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org --===============1821090853== Content-Type: multipart/alternative; boundary=0016367f985ebb86cd049d07e863 --0016367f985ebb86cd049d07e863 Content-Type: text/plain; charset=ISO-8859-1 Both dom0 and domUs are affected by this" jump". I expect to see something like "TSC marked as reliable, warp = 0". I got this on newer hardware with same config/distros. Is there a way to measure if it is a TSC warp ? to point out a cpu tsc issue ? 2011/2/24 Jan Beulich > >>> On 24.02.11 at 12:57, Olivier Hanesse > wrote: > > I tried to turn off cstates with max_cstate=0 without success (still "not > > reliable"). > > > > With cpuidle=0, I also got : > > > > (XEN) TSC has constant rate, deep Cstates possible, so not reliable, > > warp=3022 (count=1) > > This message by itself isn't telling much I believe. > > > xm info | grep command > > xen_commandline : dom0_mem=512M cpuidle=0 loglvl=all > guest_loglvl=all > > dom0_max_vcpus=1 dom0_vcpus_pin console=vga,com1 com1=19200,8n1 > > > > Keir : > > > > Using clocksource=pit : > > > > (XEN) Platform timer is 1.193MHz PIT > > > > I also got : > > > > (XEN) TSC has constant rate, deep Cstates possible, so not reliable, > > warp=3262 (count=2) > > The question is whether any of this eliminates the time jumps seen > by your DomU-s (from your past mails I wasn't actually sure whether > Dom0 also experienced this problem, albeit it would be odd if it didn't). > > Jan > > Jan > > --0016367f985ebb86cd049d07e863 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Both dom0 and domUs are affected by this" jump".

I e= xpect to see something like "TSC marked as reliable, warp =3D 0".= =A0
I got this on newer hardware with same config/distros.
<= div>
Is there a way to measure if it is a TSC warp ? to point out= a cpu tsc issue ?


2011/2/24 Jan Beulich <JBeulich@novell.com>
>>> On 24.02.11 = at 12:57, Olivier Hanesse <= olivier.hanesse@gmail.com> wrote:
> I tried to turn off cstates with max_cstate=3D0 without success (still= "not
> reliable").
>
> With cpuidle=3D0, I also got :
>
> (XEN) TSC has constant rate, deep Cstates possible, so not reliable, > warp=3D3022 (count=3D1)

This message by itself isn't telling much I believe.

> xm info | grep command
> xen_commandline =A0 =A0 =A0 =A0: dom0_mem=3D512M cpuidle=3D0 loglvl=3D= all guest_loglvl=3Dall
> dom0_max_vcpus=3D1 dom0_vcpus_pin console=3Dvga,com1 com1=3D19200,8n1<= br> >
> Keir :
>
> Using clocksource=3Dpit :
>
> (XEN) Platform timer is 1.193MHz PIT
>
> I also got :
>
> (XEN) TSC has constant rate, deep Cstates possible, so not reliable, > warp=3D3262 (count=3D2)

The question is whether any of this eliminates the time jumps seen by your DomU-s (from your past mails I wasn't actually sure whether
Dom0 also experienced this problem, albeit it would be odd if it didn't= ).

Jan

Jan


--0016367f985ebb86cd049d07e863-- --===============1821090853== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xensource.com http://lists.xensource.com/xen-devel --===============1821090853==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Keir Fraser Subject: Re: [Xen-devel] Xen 4 TSC problems Date: Thu, 24 Feb 2011 14:52:03 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset="ISO-8859-1" Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xensource.com Errors-To: xen-users-bounces@lists.xensource.com To: Olivier Hanesse , Jan Beulich Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Dan Magenheimer , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 24/02/2011 14:20, "Olivier Hanesse" wrote: > Both dom0 and domUs are affected by this" jump". >=20 > I expect to see something like "TSC marked as reliable, warp =3D 0".=A0 > I got this on newer hardware with same config/distros. It depends on the CPU itself, older CPUs do not have the super-stable TSC features. But that should never cause a massive 3000s time jump. > Is there a way to measure if it is a TSC warp ? to point out a cpu tsc is= sue ? The TSC warps or out-of-sync issues that we could reasonably expect would b= e on the order of microseconds. A 3000s warp is something else entirely. Xen is very confused and/or some TSC or platform timer has jumped a long way (indicating a hardware/firmware issue). -- Keir >=20 > 2011/2/24 Jan Beulich >>>>> On 24.02.11 at 12:57, Olivier Hanesse wro= te: >>> I tried to turn off cstates with max_cstate=3D0 without success (still "n= ot >>> reliable"). >>>=20 >>> With cpuidle=3D0, I also got : >>>=20 >>> (XEN) TSC has constant rate, deep Cstates possible, so not reliable, >>> warp=3D3022 (count=3D1) >>=20 >> This message by itself isn't telling much I believe. >>=20 >>> xm info | grep command >>> xen_commandline =A0 =A0 =A0 =A0: dom0_mem=3D512M cpuidle=3D0 loglvl=3Dall guest_loglv= l=3Dall >>> dom0_max_vcpus=3D1 dom0_vcpus_pin console=3Dvga,com1 com1=3D19200,8n1 >>>=20 >>> Keir : >>>=20 >>> Using clocksource=3Dpit : >>>=20 >>> (XEN) Platform timer is 1.193MHz PIT >>>=20 >>> I also got : >>>=20 >>> (XEN) TSC has constant rate, deep Cstates possible, so not reliable, >>> warp=3D3262 (count=3D2) >>=20 >> The question is whether any of this eliminates the time jumps seen >> by your DomU-s (from your past mails I wasn't actually sure whether >> Dom0 also experienced this problem, albeit it would be odd if it didn't)= . >>=20 >> Jan >>=20 >> Jan >>=20 >=20 >=20 From mboxrd@z Thu Jan 1 00:00:00 1970 From: Dan Magenheimer Subject: RE: [Xen-devel] Xen 4 TSC problems Date: Thu, 24 Feb 2011 09:43:22 -0800 (PST) Message-ID: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> References: Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xensource.com Errors-To: xen-users-bounces@lists.xensource.com To: Keir Fraser , Olivier Hanesse , Jan Beulich Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org Just a wild guess, but this in Olivier's posted output: (XEN) Platform timer appears to have unexpectedly wrapped 10 or more times. and the fact that a 32-bit HPET wrap is ~300 seconds and, with the "10 or more times", 10 * 300 seconds is 3000 seconds, might be a clue (or a complete red herring, but I thought it worth mentioning). Mark and Olivier, it would be interesting to know if you are using the same processor/system. > -----Original Message----- > From: Keir Fraser [mailto:keir.xen@gmail.com] > Sent: Thursday, February 24, 2011 7:52 AM > To: Olivier Hanesse; Jan Beulich > Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xen > Users; Dan Magenheimer; Keir Fraser > Subject: Re: [Xen-devel] Xen 4 TSC problems >=20 > On 24/02/2011 14:20, "Olivier Hanesse" > wrote: >=20 > > Both dom0 and domUs are affected by this" jump". > > > > I expect to see something like "TSC marked as reliable, warp =3D 0". > > I got this on newer hardware with same config/distros. >=20 > It depends on the CPU itself, older CPUs do not have the super-stable > TSC > features. But that should never cause a massive 3000s time jump. >=20 > > Is there a way to measure if it is a TSC warp ? to point out a cpu > tsc issue ? >=20 > The TSC warps or out-of-sync issues that we could reasonably expect > would be > on the order of microseconds. A 3000s warp is something else entirely. > Xen > is very confused and/or some TSC or platform timer has jumped a long > way > (indicating a hardware/firmware issue). >=20 > -- Keir >=20 > > > > 2011/2/24 Jan Beulich > >>>>> On 24.02.11 at 12:57, Olivier Hanesse > wrote: > >>> I tried to turn off cstates with max_cstate=3D0 without success > (still "not > >>> reliable"). > >>> > >>> With cpuidle=3D0, I also got : > >>> > >>> (XEN) TSC has constant rate, deep Cstates possible, so not > reliable, > >>> warp=3D3022 (count=3D1) > >> > >> This message by itself isn't telling much I believe. > >> > >>> xm info | grep command > >>> xen_commandline =A0 =A0 =A0 =A0: dom0_mem=3D512M cpuidle=3D0 loglvl= =3Dall > guest_loglvl=3Dall > >>> dom0_max_vcpus=3D1 dom0_vcpus_pin console=3Dvga,com1 com1=3D19200,8n1 > >>> > >>> Keir : > >>> > >>> Using clocksource=3Dpit : > >>> > >>> (XEN) Platform timer is 1.193MHz PIT > >>> > >>> I also got : > >>> > >>> (XEN) TSC has constant rate, deep Cstates possible, so not > reliable, > >>> warp=3D3262 (count=3D2) > >> > >> The question is whether any of this eliminates the time jumps seen > >> by your DomU-s (from your past mails I wasn't actually sure whether > >> Dom0 also experienced this problem, albeit it would be odd if it > didn't). > >> > >> Jan > >> > >> Jan > >> > > > > >=20 >=20 From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Re: Xen 4 TSC problems Date: Thu, 24 Feb 2011 18:58:08 +0100 Message-ID: References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1539987911==" Return-path: In-Reply-To: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Dan Magenheimer Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Jan Beulich , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org --===============1539987911== Content-Type: multipart/alternative; boundary=0016367f985ec8d256049d0af408 --0016367f985ec8d256049d0af408 Content-Type: text/plain; charset=ISO-8859-1 Mark is running with a E5620 Xeon processor. I got a L5420. What is very strange is that this jump is always 50min, not more, not less. And we are not alone with Mark to have this issue. So it might have an explanation somewhere (bad counter, overflow, bug or somethings). So maybe this 300 seconds * 10 is a lead. Another point, what is the number "warp" really means, in the output of "xm debug-key -s". Should I monitor this number ? Maybe I could predict a jump by watching this value ? 2011/2/24 Dan Magenheimer > Just a wild guess, but this in Olivier's posted output: > > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more times. > > and the fact that a 32-bit HPET wrap is ~300 seconds and, with the > "10 or more times", 10 * 300 seconds is 3000 seconds, might be a clue > (or a complete red herring, but I thought it worth mentioning). > > Mark and Olivier, it would be interesting to know if you are > using the same processor/system. > > > -----Original Message----- > > From: Keir Fraser [mailto:keir.xen@gmail.com] > > Sent: Thursday, February 24, 2011 7:52 AM > > To: Olivier Hanesse; Jan Beulich > > Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xen > > Users; Dan Magenheimer; Keir Fraser > > Subject: Re: [Xen-devel] Xen 4 TSC problems > > > > On 24/02/2011 14:20, "Olivier Hanesse" > > wrote: > > > > > Both dom0 and domUs are affected by this" jump". > > > > > > I expect to see something like "TSC marked as reliable, warp = 0". > > > I got this on newer hardware with same config/distros. > > > > It depends on the CPU itself, older CPUs do not have the super-stable > > TSC > > features. But that should never cause a massive 3000s time jump. > > > > > Is there a way to measure if it is a TSC warp ? to point out a cpu > > tsc issue ? > > > > The TSC warps or out-of-sync issues that we could reasonably expect > > would be > > on the order of microseconds. A 3000s warp is something else entirely. > > Xen > > is very confused and/or some TSC or platform timer has jumped a long > > way > > (indicating a hardware/firmware issue). > > > > -- Keir > > > > > > > > 2011/2/24 Jan Beulich > > >>>>> On 24.02.11 at 12:57, Olivier Hanesse > > wrote: > > >>> I tried to turn off cstates with max_cstate=0 without success > > (still "not > > >>> reliable"). > > >>> > > >>> With cpuidle=0, I also got : > > >>> > > >>> (XEN) TSC has constant rate, deep Cstates possible, so not > > reliable, > > >>> warp=3022 (count=1) > > >> > > >> This message by itself isn't telling much I believe. > > >> > > >>> xm info | grep command > > >>> xen_commandline : dom0_mem=512M cpuidle=0 loglvl=all > > guest_loglvl=all > > >>> dom0_max_vcpus=1 dom0_vcpus_pin console=vga,com1 com1=19200,8n1 > > >>> > > >>> Keir : > > >>> > > >>> Using clocksource=pit : > > >>> > > >>> (XEN) Platform timer is 1.193MHz PIT > > >>> > > >>> I also got : > > >>> > > >>> (XEN) TSC has constant rate, deep Cstates possible, so not > > reliable, > > >>> warp=3262 (count=2) > > >> > > >> The question is whether any of this eliminates the time jumps seen > > >> by your DomU-s (from your past mails I wasn't actually sure whether > > >> Dom0 also experienced this problem, albeit it would be odd if it > > didn't). > > >> > > >> Jan > > >> > > >> Jan > > >> > > > > > > > > > > > --0016367f985ec8d256049d0af408 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Mark is running with a=A0E5620 X= eon processor.
I got a=A0L5420.

What is very strange is that this jump is alwa= ys 50min, not more, not less. And we are not alone with Mark to have this i= ssue.
So it might have an explanation somewhere (bad counter, overflow, bug = or somethings).
So maybe this 300 seconds * 10 is a lead.=A0

Another point, what is the number "warp" rea= lly means, in the output of "xm debug-key -s".
Should I monitor this number ? =A0Maybe I could predict a jump by watc= hing this value ?

2011/2/24 Dan Ma= genheimer <dan.magenheimer@oracle.com>
Just a wild guess, but this in Olivier'= s posted output:

(XEN) Platform timer appears to have unexpectedly wrapped 10 or more times.=

and the fact that a 32-bit HPET wrap is ~300 seconds and, with the "10 or more times", 10 * 300 seconds is 3000 seconds, might be a = clue
(or a complete red herring, but I thought it worth mentioning).

Mark and Olivier, it would be interesting to know if you are
using the same processor/system.

> -----Original Message-----
> From: Keir Fraser [mailto:keir.x= en@gmail.com]
> Sent: Thursday, February 24, 2011 7:52 AM
> To: Olivier Hanesse; Jan Beulich
> Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xen
> Users; Dan Magenheimer; Keir Fraser
> Subject: Re: [Xen-devel] Xen 4 TSC problems
>
> On 24/02/2011 14:20, "Olivier Hanesse" <olivier.hanesse@gmail.com>
> wrote:
>
> > Both dom0 and domUs are affected by this" jump".
> >
> > I expect to see something like "TSC marked as reliable, warp= =3D 0".
> > I got this on newer hardware with same config/distros.
>
> It depends on the CPU itself, older CPUs do not have the super-stable<= br> > TSC
> features. But that should never cause a massive 3000s time jump.
>
> > Is there a way to measure if it is a TSC warp ? to point out a cp= u
> tsc issue ?
>
> The TSC warps or out-of-sync issues that we could reasonably expect > would be
> on the order of microseconds. A 3000s warp is something else entirely.=
> Xen
> is very confused and/or some TSC or platform timer has jumped a long > way
> (indicating a hardware/firmware issue).
>
> =A0-- Keir
>
> >
> > 2011/2/24 Jan Beulich <= JBeulich@novell.com>
> >>>>> On 24.02.11 at 12:57, Olivier Hanesse <olivier.hanesse@gmail.com>
> wrote:
> >>> I tried to turn off cstates with max_cstate=3D0 without s= uccess
> (still "not
> >>> reliable").
> >>>
> >>> With cpuidle=3D0, I also got :
> >>>
> >>> (XEN) TSC has constant rate, deep Cstates possible, so no= t
> reliable,
> >>> warp=3D3022 (count=3D1)
> >>
> >> This message by itself isn't telling much I believe.
> >>
> >>> xm info | grep command
> >>> xen_commandline =A0 =A0 =A0 =A0: dom0_mem=3D512M cpuidle= =3D0 loglvl=3Dall
> guest_loglvl=3Dall
> >>> dom0_max_vcpus=3D1 dom0_vcpus_pin console=3Dvga,com1 com1= =3D19200,8n1
> >>>
> >>> Keir :
> >>>
> >>> Using clocksource=3Dpit :
> >>>
> >>> (XEN) Platform timer is 1.193MHz PIT
> >>>
> >>> I also got :
> >>>
> >>> (XEN) TSC has constant rate, deep Cstates possible, so no= t
> reliable,
> >>> warp=3D3262 (count=3D2)
> >>
> >> The question is whether any of this eliminates the time jumps= seen
> >> by your DomU-s (from your past mails I wasn't actually su= re whether
> >> Dom0 also experienced this problem, albeit it would be odd if= it
> didn't).
> >>
> >> Jan
> >>
> >> Jan
> >>
> >
> >
>
>

--0016367f985ec8d256049d0af408-- --===============1539987911== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xensource.com http://lists.xensource.com/xen-devel --===============1539987911==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Jeremy Fitzhardinge Subject: Re: [Xen-devel] Xen 4 TSC problems Date: Thu, 24 Feb 2011 11:01:58 -0800 Message-ID: <4D66AB26.9020000@goop.org> References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xensource.com Errors-To: xen-users-bounces@lists.xensource.com To: Dan Magenheimer Cc: xen-devel@lists.xensource.com, Keir Fraser , Jan Beulich , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 02/24/2011 09:43 AM, Dan Magenheimer wrote: > Just a wild guess, but this in Olivier's posted output: > > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more times. > > and the fact that a 32-bit HPET wrap is ~300 seconds and, with the > "10 or more times", 10 * 300 seconds is 3000 seconds, might be a clue > (or a complete red herring, but I thought it worth mentioning). > > Mark and Olivier, it would be interesting to know if you are > using the same processor/system. It definitely seems like some kind of problem on the host system rather than anything in the guests themselves. If the platform timer is misbehaving, then Xen could be completely screwing up the pvclock calibration which it then passes to guests. Could it be one of those "platform clock stops in certain power states" problems? J >> -----Original Message----- >> From: Keir Fraser [mailto:keir.xen@gmail.com] >> Sent: Thursday, February 24, 2011 7:52 AM >> To: Olivier Hanesse; Jan Beulich >> Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xen >> Users; Dan Magenheimer; Keir Fraser >> Subject: Re: [Xen-devel] Xen 4 TSC problems >> >> On 24/02/2011 14:20, "Olivier Hanesse" >> wrote: >> >>> Both dom0 and domUs are affected by this" jump". >>> >>> I expect to see something like "TSC marked as reliable, warp = 0". >>> I got this on newer hardware with same config/distros. >> It depends on the CPU itself, older CPUs do not have the super-stable >> TSC >> features. But that should never cause a massive 3000s time jump. >> >>> Is there a way to measure if it is a TSC warp ? to point out a cpu >> tsc issue ? >> >> The TSC warps or out-of-sync issues that we could reasonably expect >> would be >> on the order of microseconds. A 3000s warp is something else entirely. >> Xen >> is very confused and/or some TSC or platform timer has jumped a long >> way >> (indicating a hardware/firmware issue). >> >> -- Keir >> >>> 2011/2/24 Jan Beulich >>>>>>> On 24.02.11 at 12:57, Olivier Hanesse >> wrote: >>>>> I tried to turn off cstates with max_cstate=0 without success >> (still "not >>>>> reliable"). >>>>> >>>>> With cpuidle=0, I also got : >>>>> >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not >> reliable, >>>>> warp=3022 (count=1) >>>> This message by itself isn't telling much I believe. >>>> >>>>> xm info | grep command >>>>> xen_commandline : dom0_mem=512M cpuidle=0 loglvl=all >> guest_loglvl=all >>>>> dom0_max_vcpus=1 dom0_vcpus_pin console=vga,com1 com1=19200,8n1 >>>>> >>>>> Keir : >>>>> >>>>> Using clocksource=pit : >>>>> >>>>> (XEN) Platform timer is 1.193MHz PIT >>>>> >>>>> I also got : >>>>> >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not >> reliable, >>>>> warp=3262 (count=2) >>>> The question is whether any of this eliminates the time jumps seen >>>> by your DomU-s (from your past mails I wasn't actually sure whether >>>> Dom0 also experienced this problem, albeit it would be odd if it >> didn't). >>>> Jan >>>> >>>> Jan >>>> >>> >> From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Re: [Xen-devel] Xen 4 TSC problems Date: Mon, 28 Feb 2011 15:37:02 +0100 Message-ID: References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> <4D66AB26.9020000@goop.org> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0454445095==" Return-path: In-Reply-To: <4D66AB26.9020000@goop.org> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xensource.com Errors-To: xen-users-bounces@lists.xensource.com To: Jeremy Fitzhardinge Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Keir Fraser , Jan Beulich , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org --===============0454445095== Content-Type: multipart/alternative; boundary=0016e64caa7ef37658049d589c98 --0016e64caa7ef37658049d589c98 Content-Type: text/plain; charset=ISO-8859-1 Hello, It happened again twice this weekend. What about setting "tsc_mode=2" for my vms ? Should this mode prevent this bug (coming from a bad emulated tsc due to firmware issue ? is it possible ?) from affecting time in domUs ? Setting clocksource=pit, make 'tsc' available in "/sys/devices/system/clocksource/clocksource0/available_clocksource" (otherwise only xen is available, is it normal ? ). Should I bypass xen clocksource and use tsc as a clocksource for dom0/domU ? or will it be worsed ? Regards Olivier 2011/2/24 Jeremy Fitzhardinge > On 02/24/2011 09:43 AM, Dan Magenheimer wrote: > > Just a wild guess, but this in Olivier's posted output: > > > > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more > times. > > > > and the fact that a 32-bit HPET wrap is ~300 seconds and, with the > > "10 or more times", 10 * 300 seconds is 3000 seconds, might be a clue > > (or a complete red herring, but I thought it worth mentioning). > > > > Mark and Olivier, it would be interesting to know if you are > > using the same processor/system. > > It definitely seems like some kind of problem on the host system rather > than anything in the guests themselves. If the platform timer is > misbehaving, then Xen could be completely screwing up the pvclock > calibration which it then passes to guests. > > Could it be one of those "platform clock stops in certain power states" > problems? > > J > > >> -----Original Message----- > >> From: Keir Fraser [mailto:keir.xen@gmail.com] > >> Sent: Thursday, February 24, 2011 7:52 AM > >> To: Olivier Hanesse; Jan Beulich > >> Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xen > >> Users; Dan Magenheimer; Keir Fraser > >> Subject: Re: [Xen-devel] Xen 4 TSC problems > >> > >> On 24/02/2011 14:20, "Olivier Hanesse" > >> wrote: > >> > >>> Both dom0 and domUs are affected by this" jump". > >>> > >>> I expect to see something like "TSC marked as reliable, warp = 0". > >>> I got this on newer hardware with same config/distros. > >> It depends on the CPU itself, older CPUs do not have the super-stable > >> TSC > >> features. But that should never cause a massive 3000s time jump. > >> > >>> Is there a way to measure if it is a TSC warp ? to point out a cpu > >> tsc issue ? > >> > >> The TSC warps or out-of-sync issues that we could reasonably expect > >> would be > >> on the order of microseconds. A 3000s warp is something else entirely. > >> Xen > >> is very confused and/or some TSC or platform timer has jumped a long > >> way > >> (indicating a hardware/firmware issue). > >> > >> -- Keir > >> > >>> 2011/2/24 Jan Beulich > >>>>>>> On 24.02.11 at 12:57, Olivier Hanesse > >> wrote: > >>>>> I tried to turn off cstates with max_cstate=0 without success > >> (still "not > >>>>> reliable"). > >>>>> > >>>>> With cpuidle=0, I also got : > >>>>> > >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not > >> reliable, > >>>>> warp=3022 (count=1) > >>>> This message by itself isn't telling much I believe. > >>>> > >>>>> xm info | grep command > >>>>> xen_commandline : dom0_mem=512M cpuidle=0 loglvl=all > >> guest_loglvl=all > >>>>> dom0_max_vcpus=1 dom0_vcpus_pin console=vga,com1 com1=19200,8n1 > >>>>> > >>>>> Keir : > >>>>> > >>>>> Using clocksource=pit : > >>>>> > >>>>> (XEN) Platform timer is 1.193MHz PIT > >>>>> > >>>>> I also got : > >>>>> > >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not > >> reliable, > >>>>> warp=3262 (count=2) > >>>> The question is whether any of this eliminates the time jumps seen > >>>> by your DomU-s (from your past mails I wasn't actually sure whether > >>>> Dom0 also experienced this problem, albeit it would be odd if it > >> didn't). > >>>> Jan > >>>> > >>>> Jan > >>>> > >>> > >> > > --0016e64caa7ef37658049d589c98 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
Hello,

It happened again twice this weekend.
<= br>
What about setting "tsc_mode=3D2" for my vms ? Shou= ld this mode prevent this bug (coming from a bad emulated tsc due to firmwa= re issue ? is it possible ?) from affecting time in domUs ?

Setting clocksource=3Dpit, make 'tsc' available= in "/sys/devices/system/clocksource/clocksource0/available_clocksourc= e" (otherwise only xen is available, is it normal ? ).=A0
Should I bypass xen clocksource and use tsc as a clocksource for= dom0/domU ? or =A0will it be worsed ?

Regards

Olivier

2011= /2/24 Jeremy Fitzhardinge <jeremy@goop.org>
On 02/24/2011 09:43 AM, D= an Magenheimer wrote:
> Just a wild guess, but this in Olivier's posted output:
>
> (XEN) Platform timer appears to have unexpectedly wrapped 10 or more t= imes.
>
> and the fact that a 32-bit HPET wrap is ~300 seconds and, with the
> "10 or more times", 10 * 300 seconds is 3000 seconds, might = be a clue
> (or a complete red herring, but I thought it worth mentioning).
>
> Mark and Olivier, it would be interesting to know if you are
> using the same processor/system.

It definitely seems like some kind of problem on the host system rath= er
than anything in the guests themselves. =A0If the platform timer is
misbehaving, then Xen could be completely screwing up the pvclock
calibration which it then passes to guests.

Could it be one of those "platform clock stops in certain power states= "
problems?

=A0 =A0J

>> -----Original Message-----
>> From: Keir Fraser [mailto:ke= ir.xen@gmail.com]
>> Sent: Thursday, February 24, 2011 7:52 AM
>> To: Olivier Hanesse; Jan Beulich
>> Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xen
>> Users; Dan Magenheimer; Keir Fraser
>> Subject: Re: [Xen-devel] Xen 4 TSC problems
>>
>> On 24/02/2011 14:20, "Olivier Hanesse" <olivier.hanesse@gmail.com>
>> wrote:
>>
>>> Both dom0 and domUs are affected by this" jump".
>>>
>>> I expect to see something like "TSC marked as reliable, w= arp =3D 0".
>>> I got this on newer hardware with same config/distros.
>> It depends on the CPU itself, older CPUs do not have the super-sta= ble
>> TSC
>> features. But that should never cause a massive 3000s time jump. >>
>>> Is there a way to measure if it is a TSC warp ? to point out a= cpu
>> tsc issue ?
>>
>> The TSC warps or out-of-sync issues that we could reasonably expec= t
>> would be
>> on the order of microseconds. A 3000s warp is something else entir= ely.
>> Xen
>> is very confused and/or some TSC or platform timer has jumped a lo= ng
>> way
>> (indicating a hardware/firmware issue).
>>
>> =A0-- Keir
>>
>>> 2011/2/24 Jan Beulich <JBeulich@novell.com>
>>>>>>> On 24.02.11 at 12:57, Olivier Hanesse <olivier.hanesse@gmail.com><= br> >> wrote:
>>>>> I tried to turn off cstates with max_cstate=3D0 withou= t success
>> (still "not
>>>>> reliable").
>>>>>
>>>>> With cpuidle=3D0, I also got :
>>>>>
>>>>> (XEN) TSC has constant rate, deep Cstates possible, so= not
>> reliable,
>>>>> warp=3D3022 (count=3D1)
>>>> This message by itself isn't telling much I believe. >>>>
>>>>> xm info | grep command
>>>>> xen_commandline =A0 =A0 =A0 =A0: dom0_mem=3D512M cpuid= le=3D0 loglvl=3Dall
>> guest_loglvl=3Dall
>>>>> dom0_max_vcpus=3D1 dom0_vcpus_pin console=3Dvga,com1 c= om1=3D19200,8n1
>>>>>
>>>>> Keir :
>>>>>
>>>>> Using clocksource=3Dpit :
>>>>>
>>>>> (XEN) Platform timer is 1.193MHz PIT
>>>>>
>>>>> I also got :
>>>>>
>>>>> (XEN) TSC has constant rate, deep Cstates possible, so= not
>> reliable,
>>>>> warp=3D3262 (count=3D2)
>>>> The question is whether any of this eliminates the time ju= mps seen
>>>> by your DomU-s (from your past mails I wasn't actually= sure whether
>>>> Dom0 also experienced this problem, albeit it would be odd= if it
>> didn't).
>>>> Jan
>>>>
>>>> Jan
>>>>
>>>
>>


--0016e64caa7ef37658049d589c98-- --===============0454445095== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users --===============0454445095==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Keir Fraser Subject: Re: Xen 4 TSC problems Date: Mon, 28 Feb 2011 15:00:47 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset="ISO-8859-1" Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Olivier Hanesse , Jeremy Fitzhardinge Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Keir Fraser , Jan Beulich , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org The message about detecting wrapped platform timer on Xen console indicates a host problem rather than a guest configuration problem. Did you try running long term with changed platform timer source on Xen command line (clocksource=3Dpit), and also cpuidle=3D0? K. On 28/02/2011 14:37, "Olivier Hanesse" wrote: > Hello, >=20 > It happened again twice this weekend. >=20 > What about setting "tsc_mode=3D2" for my vms ? Should this mode prevent thi= s bug > (coming from a bad emulated tsc due to firmware issue ? is it possible ?)= from > affecting time in domUs ? >=20 > Setting clocksource=3Dpit, make 'tsc' available in > "/sys/devices/system/clocksource/clocksource0/available_clocksource" > (otherwise only xen is available, is it normal ? ).=A0 >=20 > Should I bypass xen clocksource and use tsc as a clocksource for dom0/dom= U ? > or =A0will it be worsed ? >=20 > Regards >=20 > Olivier >=20 > 2011/2/24 Jeremy Fitzhardinge >> On 02/24/2011 09:43 AM, Dan Magenheimer wrote: >>> Just a wild guess, but this in Olivier's posted output: >>>=20 >>> (XEN) Platform timer appears to have unexpectedly wrapped 10 or more ti= mes. >>>=20 >>> and the fact that a 32-bit HPET wrap is ~300 seconds and, with the >>> "10 or more times", 10 * 300 seconds is 3000 seconds, might be a clue >>> (or a complete red herring, but I thought it worth mentioning). >>>=20 >>> Mark and Olivier, it would be interesting to know if you are >>> using the same processor/system. >>=20 >> It definitely seems like some kind of problem on the host system rather >> than anything in the guests themselves. =A0If the platform timer is >> misbehaving, then Xen could be completely screwing up the pvclock >> calibration which it then passes to guests. >>=20 >> Could it be one of those "platform clock stops in certain power states" >> problems? >>=20 >> =A0 =A0J >>=20 >>>> -----Original Message----- >>>> From: Keir Fraser [mailto:keir.xen@gmail.com] >>>> Sent: Thursday, February 24, 2011 7:52 AM >>>> To: Olivier Hanesse; Jan Beulich >>>> Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xe= n >>>> Users; Dan Magenheimer; Keir Fraser >>>> Subject: Re: [Xen-devel] Xen 4 TSC problems >>>>=20 >>>> On 24/02/2011 14:20, "Olivier Hanesse" >>>> wrote: >>>>=20 >>>>> Both dom0 and domUs are affected by this" jump". >>>>>=20 >>>>> I expect to see something like "TSC marked as reliable, warp =3D 0". >>>>> I got this on newer hardware with same config/distros. >>>> It depends on the CPU itself, older CPUs do not have the super-stable >>>> TSC >>>> features. But that should never cause a massive 3000s time jump. >>>>=20 >>>>> Is there a way to measure if it is a TSC warp ? to point out a cpu >>>> tsc issue ? >>>>=20 >>>> The TSC warps or out-of-sync issues that we could reasonably expect >>>> would be >>>> on the order of microseconds. A 3000s warp is something else entirely. >>>> Xen >>>> is very confused and/or some TSC or platform timer has jumped a long >>>> way >>>> (indicating a hardware/firmware issue). >>>>=20 >>>> =A0-- Keir >>>>=20 >>>>> 2011/2/24 Jan Beulich >>>>>>>>> On 24.02.11 at 12:57, Olivier Hanesse >>>> wrote: >>>>>>> I tried to turn off cstates with max_cstate=3D0 without success >>>> (still "not >>>>>>> reliable"). >>>>>>>=20 >>>>>>> With cpuidle=3D0, I also got : >>>>>>>=20 >>>>>>> (XEN) TSC has constant rate, deep Cstates possible, so not >>>> reliable, >>>>>>> warp=3D3022 (count=3D1) >>>>>> This message by itself isn't telling much I believe. >>>>>>=20 >>>>>>> xm info | grep command >>>>>>> xen_commandline =A0 =A0 =A0 =A0: dom0_mem=3D512M cpuidle=3D0 loglvl=3Dall >>>> guest_loglvl=3Dall >>>>>>> dom0_max_vcpus=3D1 dom0_vcpus_pin console=3Dvga,com1 com1=3D19200,8n1 >>>>>>>=20 >>>>>>> Keir : >>>>>>>=20 >>>>>>> Using clocksource=3Dpit : >>>>>>>=20 >>>>>>> (XEN) Platform timer is 1.193MHz PIT >>>>>>>=20 >>>>>>> I also got : >>>>>>>=20 >>>>>>> (XEN) TSC has constant rate, deep Cstates possible, so not >>>> reliable, >>>>>>> warp=3D3262 (count=3D2) >>>>>> The question is whether any of this eliminates the time jumps seen >>>>>> by your DomU-s (from your past mails I wasn't actually sure whether >>>>>> Dom0 also experienced this problem, albeit it would be odd if it >>>> didn't). >>>>>> Jan >>>>>>=20 >>>>>> Jan >>>>>>=20 >>>>>=20 >>>>=20 >>=20 >=20 >=20 From mboxrd@z Thu Jan 1 00:00:00 1970 From: Dan Magenheimer Subject: RE: [Xen-devel] Xen 4 TSC problems Date: Mon, 28 Feb 2011 07:14:23 -0800 (PST) Message-ID: <49766a9d-0e37-46d2-9497-33c2e981a871@default> References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> <4D66AB26.9020000@goop.org AANLkTi=NFebVsnj_09+SpCZUo_bNTnTyJLAoR5sfbwZa@mail.gmail.com> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1911446809==" Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xensource.com Errors-To: xen-users-bounces@lists.xensource.com To: Olivier Hanesse , Jeremy Fitzhardinge Cc: xen-devel@lists.xensource.com, Keir Fraser , Jan Beulich , Mark, Keir Fraser , Xen Users , Adams List-Id: xen-devel@lists.xenproject.org --===============1911446809== Content-Type: multipart/alternative; boundary="__129890608367518096abhmt020" --__129890608367518096abhmt020 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Hi Olivier - =20 It is the Xen clocksource that you want to try to change, not the dom0 cloc= ksource. To do this, you need to specify "clocksource=3Dpit" on the Xen bo= ot line (and reboot), not the dom0 boot line. =20 I believe Mark Adams played with tsc_mode to see if it solved his (similar?= identical?) problem last year, and it didn't make any difference. Please try booting Xen with "clocksource=3Dpit" and ensure that "Platform t= imer is 1.19MHz PIT" appears in the Xen boot messages. If the 50min jump d= oes not appear again, it would point to a problem in the hpet, either hardw= are or software. =20 Thanks, Dan =20 From: Olivier Hanesse [mailto:olivier.hanesse@gmail.com]=20 Sent: Monday, February 28, 2011 7:37 AM To: Jeremy Fitzhardinge Cc: Dan Magenheimer; Keir Fraser; Jan Beulich; Mark Adams; xen-devel@lists.= xensource.com; Xen Users; Keir Fraser Subject: Re: [Xen-devel] Xen 4 TSC problems =20 Hello, =20 It happened again twice this weekend. =20 What about setting "tsc_mode=3D2" for my vms ? Should this mode prevent thi= s bug (coming from a bad emulated tsc due to firmware issue ? is it possibl= e ?) from affecting time in domUs ? =20 Setting clocksource=3Dpit, make 'tsc' available in "/sys/devices/system/clo= cksource/clocksource0/available_clocksource" (otherwise only xen is availab= le, is it normal ? ).=20 =20 Should I bypass xen clocksource and use tsc as a clocksource for dom0/domU = ? or will it be worsed ? =20 Regards =20 Olivier =20 2011/2/24 Jeremy Fitzhardinge On 02/24/2011 09:43 AM, Dan Magenheimer wrote: > Just a wild guess, but this in Olivier's posted output: > > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more time= s. > > and the fact that a 32-bit HPET wrap is ~300 seconds and, with the > "10 or more times", 10 * 300 seconds is 3000 seconds, might be a clue > (or a complete red herring, but I thought it worth mentioning). > > Mark and Olivier, it would be interesting to know if you are > using the same processor/system. It definitely seems like some kind of problem on the host system rather than anything in the guests themselves. If the platform timer is misbehaving, then Xen could be completely screwing up the pvclock calibration which it then passes to guests. Could it be one of those "platform clock stops in certain power states" problems? J >> -----Original Message----- >> From: Keir Fraser [mailto:HYPERLINK "mailto:keir.xen@gmail.com"keir.xen@= gmail.com] >> Sent: Thursday, February 24, 2011 7:52 AM >> To: Olivier Hanesse; Jan Beulich >> Cc: Mark Adams; Jeremy Fitzhardinge; HYPERLINK "mailto:xen-devel@lists.x= ensource.com"xen-devel@lists.xensource.com; Xen >> Users; Dan Magenheimer; Keir Fraser >> Subject: Re: [Xen-devel] Xen 4 TSC problems >> >> On 24/02/2011 14:20, "Olivier Hanesse" >> wrote: >> >>> Both dom0 and domUs are affected by this" jump". >>> >>> I expect to see something like "TSC marked as reliable, warp =3D 0". >>> I got this on newer hardware with same config/distros. >> It depends on the CPU itself, older CPUs do not have the super-stable >> TSC >> features. But that should never cause a massive 3000s time jump. >> >>> Is there a way to measure if it is a TSC warp ? to point out a cpu >> tsc issue ? >> >> The TSC warps or out-of-sync issues that we could reasonably expect >> would be >> on the order of microseconds. A 3000s warp is something else entirely. >> Xen >> is very confused and/or some TSC or platform timer has jumped a long >> way >> (indicating a hardware/firmware issue). >> >> -- Keir >> >>> 2011/2/24 Jan Beulich >>>>>>> On 24.02.11 at 12:57, Olivier Hanesse >> wrote: >>>>> I tried to turn off cstates with max_cstate=3D0 without success >> (still "not >>>>> reliable"). >>>>> >>>>> With cpuidle=3D0, I also got : >>>>> >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not >> reliable, >>>>> warp=3D3022 (count=3D1) >>>> This message by itself isn't telling much I believe. >>>> >>>>> xm info | grep command >>>>> xen_commandline : dom0_mem=3D512M cpuidle=3D0 loglvl=3Dall >> guest_loglvl=3Dall >>>>> dom0_max_vcpus=3D1 dom0_vcpus_pin console=3Dvga,com1 com1=3D19200,8n1 >>>>> >>>>> Keir : >>>>> >>>>> Using clocksource=3Dpit : >>>>> >>>>> (XEN) Platform timer is 1.193MHz PIT >>>>> >>>>> I also got : >>>>> >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not >> reliable, >>>>> warp=3D3262 (count=3D2) >>>> The question is whether any of this eliminates the time jumps seen >>>> by your DomU-s (from your past mails I wasn't actually sure whether >>>> Dom0 also experienced this problem, albeit it would be odd if it >> didn't). >>>> Jan >>>> >>>> Jan >>>> >>> >> =20 --__129890608367518096abhmt020 Content-Type: text/html; charset=us-ascii Content-Transfer-Encoding: quoted-printable

Hi Ol= ivier –

&nb= sp;

It is the Xen clocksour= ce that you want to try to change, not the dom0 clocksource.  To do = this, you need to specify “clocksource=3Dpit” on the Xen boot= line (and reboot), not the dom0 boot line.

 

I believe Mark Adams played with tsc_mode to see if it solved! =20 his (similar? identical?) problem last year, and it didn’t make any dif= ference.


Please tr= y booting Xen with “clocksource=3Dpit” and ensure that “= ;Platform timer is 1.19MHz PIT” appears in the Xen boot messages.&n= bsp; If the 50min jump does not appear again, it would point to a problem= in the hpet, either hardware or software.

 

Thanks,

Dan

 

From: Olivier Hanesse [mailto:olivier.hanesse@gmail.com]
Sen= t: Monday, February 28, 2011 7:37 AM
To: Jeremy Fitzharding= e
Cc: Dan Magenheimer; Keir Fraser; Jan Beulich; Mark Adams; xe= n-devel@lists.xensource.com; Xen Users; Keir Fraser
Subject: Re= : [Xen-devel] Xen 4 TSC problems

 

Hello,<= /o:p>

 

It happened again twice this weekend.

 

What about setting "tsc_mode=3D2" = for my vms ? Should this mode prevent this bug (coming from a bad emulate= d tsc due to firmware issue ? is it possible ?) from affecting time in do= mUs ?

 

=

Setting clocksource=3Dpit, make 'tsc' ava= ilable in "/sys/devices/system/clocksource/clocksource0/available_cl= ocksource" (otherwise only xen is available, is it normal ? ). =

 

Should I bypass xen clocksource and use tsc as= a clocksource for dom0/domU ? or  will it be worsed ?

 

Regards

&= nbsp;

Olivier

<= o:p> 

2011/2/24 Jeremy Fitzhardin= ge <jeremy@goop.org>

On 02/24/= 2011 09:43 AM, Dan Magenheimer wrote:
> Just a wild guess, but this= in Olivier's posted output:
>
> (XEN) Platform timer appears= to have unexpectedly wrapped 10 or more times.
>
> and the f= act that a 32-bit HPET wrap is ~300 seconds and, with the
> "1= 0 or more times", 10 * 300 seconds is 3000 seconds, might be a clue<= br>> (or a complete red herring, but I thought it worth mentioning).>
> Mark and Olivier, it would be interesting to know if you a= re
> using the same processor/system.

It definitely seems like some kind of problem on the host syste= m rather
than anything in the guests themselves. ! =20  If the platform timer is
misbehaving, then Xen could be completely screwi= ng up the pvclock
calibration which it then passes to guests.

C= ould it be one of those "platform clock stops in certain power state= s"
problems?


   J

>> -----Original M= essage-----
>> From: Keir Fraser [mailto:keir.xen@gmail.com]
>> Sent: Thursday, Februa= ry 24, 2011 7:52 AM
>> To: Olivier Hanesse; Jan Beulich
>&= gt; Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xen
>> Users= ; Dan Magenheimer; Keir Fraser
>> Subject: Re: [Xen-devel] Xen 4= TSC problems
>>
>> On 24/02/2011 14:20, "Olivier = Hanesse" <olivier.ha= nesse@gmail.com>
>> wrote:
>>
>>> Both dom0 and domUs are affected by th= is" jump".
>>>
>>> I expect to see some= thing like "TSC marked as reliable, warp =3D 0".
>>>= ; I got this on newer hardware with same config/distros.
>> It d= epends on the CPU itself, older CPUs do not have the super-stable
>= > TSC
>> features. But that should never cause a massive 3000= s time jump.
>>
>>> Is there a way to measure if it = is a TSC warp ? to point out a cpu
>> tsc issue ?
>>>> The TSC warps or out-of-sync issues that we could reasonably ex= pect
>> would be
>> on the order of microseconds. A 300= 0s warp is something else entirely.
>> Xen
>> is very c= onfused and/or some TSC or platform timer has jumped a long
>> w= ay
>> (indicating a hardware/firmware issue).
>>
>= ;>  -- Keir
>>
>>! =20 ;> 2011/2/24 Jan Beulich <JBeulich= @novell.com>
>>>>>>> On 24.02.11 at 12:57,= Olivier Hanesse <olivier= .hanesse@gmail.com>
>> wrote:
>>>>> I t= ried to turn off cstates with max_cstate=3D0 without success
>> = (still "not
>>>>> reliable").
>>>= >>
>>>>> With cpuidle=3D0, I also got :
>&g= t;>>>
>>>>> (XEN) TSC has constant rate, deep = Cstates possible, so not
>> reliable,
>>>>> wa= rp=3D3022 (count=3D1)
>>>> This message by itself isn't te= lling much I believe.
>>>>
>>>>> xm info= | grep command
>>>>> xen_commandline     &nb= sp;  : dom0_mem=3D512M cpuidle=3D0 loglvl=3Dall
>> guest_lo= glvl=3Dall
>>>>> dom0_max_vcpus=3D1 dom0_vcp! =20 us_pin console=3Dvga,com1 com1=3D19200,8n1
>>>>>
>>&g= t;>> Keir :
>>>>>
>>>>> Using c= locksource=3Dpit :
>>>>>
>>>>> (XEN) = Platform timer is 1.193MHz PIT
>>>>>
>>>>= ;> I also got :
>>>>>
>>>>> (XEN) = TSC has constant rate, deep Cstates possible, so not
>> reliable= ,
>>>>> warp=3D3262 (count=3D2)
>>>> The= question is whether any of this eliminates the time jumps seen
>&g= t;>> by your DomU-s (from your past mails I wasn't actually sure wh= ether
>>>> Dom0 also experienced this problem, albeit it w= ould be odd if it
>> didn't).
>>>> Jan
>>= ;>>
>>>> Jan
>>>>
>>>
= >>

 =

--__129890608367518096abhmt020-- --===============1911446809== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users --===============1911446809==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Re: [Xen-devel] Xen 4 TSC problems Date: Mon, 28 Feb 2011 16:23:07 +0100 Message-ID: References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> <49766a9d-0e37-46d2-9497-33c2e981a871@default> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1988755835==" Return-path: In-Reply-To: <49766a9d-0e37-46d2-9497-33c2e981a871@default> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xensource.com Errors-To: xen-users-bounces@lists.xensource.com To: Dan Magenheimer Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Jan Beulich , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org --===============1988755835== Content-Type: multipart/alternative; boundary=0015175cb0c6bbce5f049d59411e --0015175cb0c6bbce5f049d59411e Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Keir : Yes, it is "under progress". To make this change, I had to reboot every server, so it is taking time (production server :() So i was hoping to find a quick method to mitigate this issue on domUs whil= e rebooting servers. As this bug happens once or twice per server since October, I can't say tha= t right now that changing platform timer to PIT fixed it. I have to wait (I hope forever!) this bug to happen again on a 'patched' server ... But even with clcoksource=3Dpit, I am seeing some warp=3D3000+ in debug mes= sage ? I guess it is not a good sign, is it ? Jan : I was hoping to find a way to make the domU clocksource more "independent" like with xen3.2. 2011/2/28 Dan Magenheimer > Hi Olivier =96 > > > > It is the Xen clocksource that you want to try to change, not the dom0 > clocksource. To do this, you need to specify =93clocksource=3Dpit=94 on = the Xen > boot line (and reboot), not the dom0 boot line. > > > > I believe Mark Adams played with tsc_mode to see if it solved! his > (similar? identical?) problem last year, and it didn=92t make any differe= nce. > > > Please try booting Xen with =93clocksource=3Dpit=94 and ensure that =93Pl= atform > timer is 1.19MHz PIT=94 appears in the Xen boot messages. If the 50min j= ump > does not appear again, it would point to a problem in the hpet, either > hardware or software. > > > > Thanks, > > Dan > > > > *From:* Olivier Hanesse [mailto:olivier.hanesse@gmail.com] > *Sent:* Monday, February 28, 2011 7:37 AM > *To:* Jeremy Fitzhardinge > *Cc:* Dan Magenheimer; Keir Fraser; Jan Beulich; Mark Adams; > xen-devel@lists.xensource.com; Xen Users; Keir Fraser > > *Subject:* Re: [Xen-devel] Xen 4 TSC problems > > > > Hello, > > > > It happened again twice this weekend. > > > > What about setting "tsc_mode=3D2" for my vms ? Should this mode prevent t= his > bug (coming from a bad emulated tsc due to firmware issue ? is it possibl= e > ?) from affecting time in domUs ? > > > > Setting clocksource=3Dpit, make 'tsc' available in > "/sys/devices/system/clocksource/clocksource0/available_clocksource" > (otherwise only xen is available, is it normal ? ). > > > > Should I bypass xen clocksource and use tsc as a clocksource for dom0/dom= U > ? or will it be worsed ? > > > > Regards > > > > Olivier > > > > 2011/2/24 Jeremy Fitzhardinge > > On 02/24/2011 09:43 AM, Dan Magenheimer wrote: > > Just a wild guess, but this in Olivier's posted output: > > > > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more > times. > > > > and the fact that a 32-bit HPET wrap is ~300 seconds and, with the > > "10 or more times", 10 * 300 seconds is 3000 seconds, might be a clue > > (or a complete red herring, but I thought it worth mentioning). > > > > Mark and Olivier, it would be interesting to know if you are > > using the same processor/system. > > It definitely seems like some kind of problem on the host system rather > than anything in the guests themselves. ! If the platform timer is > misbehaving, then Xen could be completely screwing up the pvclock > calibration which it then passes to guests. > > Could it be one of those "platform clock stops in certain power states" > problems? > > > J > > >> -----Original Message----- > >> From: Keir Fraser [mailto:keir.xen@gmail.com] > >> Sent: Thursday, February 24, 2011 7:52 AM > >> To: Olivier Hanesse; Jan Beulich > >> Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xe= n > >> Users; Dan Magenheimer; Keir Fraser > >> Subject: Re: [Xen-devel] Xen 4 TSC problems > >> > >> On 24/02/2011 14:20, "Olivier Hanesse" > >> wrote: > >> > >>> Both dom0 and domUs are affected by this" jump". > >>> > >>> I expect to see something like "TSC marked as reliable, warp =3D 0". > >>> I got this on newer hardware with same config/distros. > >> It depends on the CPU itself, older CPUs do not have the super-stable > >> TSC > >> features. But that should never cause a massive 3000s time jump. > >> > >>> Is there a way to measure if it is a TSC warp ? to point out a cpu > >> tsc issue ? > >> > >> The TSC warps or out-of-sync issues that we could reasonably expect > >> would be > >> on the order of microseconds. A 3000s warp is something else entirely. > >> Xen > >> is very confused and/or some TSC or platform timer has jumped a long > >> way > >> (indicating a hardware/firmware issue). > >> > >> -- Keir > >> > >>! ;> 2011/2/24 Jan Beulich > > >>>>>>> On 24.02.11 at 12:57, Olivier Hanesse > >> wrote: > >>>>> I tried to turn off cstates with max_cstate=3D0 without success > >> (still "not > >>>>> reliable"). > >>>>> > >>>>> With cpuidle=3D0, I also got : > >>>>> > >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not > >> reliable, > >>>>> warp=3D3022 (count=3D1) > >>>> This message by itself isn't telling much I believe. > >>>> > >>>>> xm info | grep command > >>>>> xen_commandline : dom0_mem=3D512M cpuidle=3D0 loglvl=3Dall > >> guest_loglvl=3Dall > >>>>> dom0_max_vcpus=3D1 dom0_vcp! us_pin console=3Dvga,com1 com1=3D19200= ,8n1 > > >>>>> > >>>>> Keir : > >>>>> > >>>>> Using clocksource=3Dpit : > >>>>> > >>>>> (XEN) Platform timer is 1.193MHz PIT > >>>>> > >>>>> I also got : > >>>>> > >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not > >> reliable, > >>>>> warp=3D3262 (count=3D2) > >>>> The question is whether any of this eliminates the time jumps seen > >>>> by your DomU-s (from your past mails I wasn't actually sure whether > >>>> Dom0 also experienced this problem, albeit it would be odd if it > >> didn't). > >>>> Jan > >>>> > >>>> Jan > >>>> > >>> > >> > > > --0015175cb0c6bbce5f049d59411e Content-Type: text/html; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Keir :=A0

Yes, it is "under progress"= .=A0
To make this change, I had to reboot every server, so it is = taking time (production server :()
So i was hoping to find a quick method to mitigate this issue on domUs= while rebooting servers.

As this bug happens once= or twice per server since October, I can't say that right now that cha= nging platform timer to PIT fixed it. I have to wait (I hope forever!) this= bug to happen again on a 'patched' server ...=A0

But even with clcoksource=3Dpit, I am seeing some warp= =3D3000+ in debug message ? I guess it is not a good sign, is it ?

Jan : I was hoping to find a way to make the domU clocksou= rce more "independent" like with xen3.2.


2011/2/28 Dan Mage= nheimer <dan.magenheimer@oracle.com>

Hi Olivier =96

=A0<= /span>

It is the Xen clocksource th= at you want to try to change, not the dom0 clocksource.=A0 To do this, you = need to specify =93clocksource=3Dpit=94 on the Xen boot line (and reboot), = not the dom0 boot line.

=A0

I believe Mark Adams played with tsc_mode to see if it solved! his (similar? identical?) problem last year, and it didn=92t make any differenc= e.


Please try booting Xen with =93clocksource=3Dpit=94 and ensure = that =93Platform timer is 1.19MHz PIT=94 appears in the Xen boot messages.= =A0 If the 50min jump does not appear again, it would point to a problem in= the hpet, either hardware or software.

=A0

Thanks,

Dan

=A0

From:= Olivier Hanesse [mailto:olivier.hanesse@gmail.com= ]
Sent: Monday, February 28, 2011 7:37 AM
To: Jeremy Fitzhar= dinge
Cc: Dan Magenheimer; Keir Fraser; Jan Beulich; Mark Adams; = xen-deve= l@lists.xensource.com; Xen Users; Keir Fraser


Subject: Re: [Xen-devel] Xen 4= TSC problems

=A0

Hello,

=A0

It happened agai= n twice this weekend.

=A0

What about setting "tsc_mode=3D2" for my vms ? Should this = mode prevent this bug (coming from a bad emulated tsc due to firmware issue= ? is it possible ?) from affecting time in domUs ?

=A0

= Setting clocksource=3Dpit, make 'tsc' available in "/sys/devic= es/system/clocksource/clocksource0/available_clocksource" (otherwise o= nly xen is available, is it normal ? ).=A0

=A0

= Should I bypass xen clocksource and use tsc as a clocksource for dom0/domU = ? or =A0will it be worsed ?

=A0

Regards

=A0

=

Olivier

=A0

2011/2/24 Jeremy Fitzhardinge <jeremy@goop.org>

On 02/24/2011 09= :43 AM, Dan Magenheimer wrote:
> Just a wild guess, but this in Olivi= er's posted output:
>
> (XEN) Platform timer appears to hav= e unexpectedly wrapped 10 or more times.
>
> and the fact that a 32-bit HPET wrap is ~300 seconds and, with= the
> "10 or more times", 10 * 300 seconds is 3000 seconds= , might be a clue
> (or a complete red herring, but I thought it wort= h mentioning).
>
> Mark and Olivier, it would be interesting to know if you are> using the same processor/system.

It = definitely seems like some kind of problem on the host system rather
tha= n anything in the guests themselves. ! =A0If the platform timer is
misbehaving, then Xen could be completely screwing= up the pvclock
calibration which it then passes to guests.

Could= it be one of those "platform clock stops in certain power states"= ;
problems?


=A0 =A0J

>= ;> -----Original Message-----
>> From: Keir Fraser [mailto:keir.xen@gmail.com]=
>> Sent: Thursday, February 24, 2011 7:52 AM
>> To: Olivier = Hanesse; Jan Beulich
>> Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lis= ts.xensource.com; Xen
>> Users; Dan Magenheimer; Keir Fraser
>> Subject: Re: [Xen-= devel] Xen 4 TSC problems
>>
>> On 24/02/2011 14:20, &quo= t;Olivier Hanesse" <olivier.hanesse@gmail.com>
>> wrote:
>>
>>> Both dom0 and domUs are affected by this= " jump".
>>>
>>> I expect to see somethin= g like "TSC marked as reliable, warp =3D 0".
>>> I go= t this on newer hardware with same config/distros.
>> It depends on the CPU itself, older CPUs do not have the super-sta= ble
>> TSC
>> features. But that should never cause a mas= sive 3000s time jump.
>>
>>> Is there a way to measure= if it is a TSC warp ? to point out a cpu
>> tsc issue ?
>>
>> The TSC warps or out-of-sync i= ssues that we could reasonably expect
>> would be
>> on t= he order of microseconds. A 3000s warp is something else entirely.
>&= gt; Xen
>> is very confused and/or some TSC or platform timer has jumped a lo= ng
>> way
>> (indicating a hardware/firmware issue).
&= gt;>
>> =A0-- Keir
>>
>&gt! ;> 2011/2/24 Jan Beulich <JBeulich@novell.com>

>>>>&g= t;>> On 24.02.11 at 12:57, Olivier Hanesse <olivier.hanesse@gmail.com><= br> >> wrote:
>>>>> I tried to turn off cstates with ma= x_cstate=3D0 without success
>> (still "not
>>>&g= t;> reliable").
>>>>>
>>>>> Wit= h cpuidle=3D0, I also got :
>>>>>
>>>>> (XEN) TSC has constant rate, d= eep Cstates possible, so not
>> reliable,
>>>>> = warp=3D3022 (count=3D1)
>>>> This message by itself isn'= t telling much I believe.
>>>>
>>>>> xm info | grep command
>>= >>> xen_commandline =A0 =A0 =A0 =A0: dom0_mem=3D512M cpuidle=3D0 l= oglvl=3Dall
>> guest_loglvl=3Dall
>>>>> do= m0_max_vcpus=3D1 dom0_vcp! us_pin console=3Dvga,com1 com1=3D19200,8n1

>>>>&g= t;
>>>>> Keir :
>>>>>
>>>&g= t;> Using clocksource=3Dpit :
>>>>>
>>>>= ;> (XEN) Platform timer is 1.193MHz PIT
>>>>>
>>>>> I also got :
>>>&g= t;>
>>>>> (XEN) TSC has constant rate, deep Cstates po= ssible, so not
>> reliable,
>>>>> warp=3D3262 (c= ount=3D2)
>>>> The question is whether any of this eliminates the time ju= mps seen
>>>> by your DomU-s (from your past mails I wasn= 9;t actually sure whether
>>>> Dom0 also experienced this pr= oblem, albeit it would be odd if it
>> didn't).
>>>> Jan
>>>>
>&g= t;>> Jan
>>>>
>>>
>>

=

=A0


--0015175cb0c6bbce5f049d59411e-- --===============1988755835== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users --===============1988755835==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Dan Magenheimer Subject: RE: [Xen-devel] Xen 4 TSC problems Date: Mon, 28 Feb 2011 07:30:30 -0800 (PST) Message-ID: References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> <49766a9d-0e37-46d2-9497-33c2e981a871@default AANLkTikptc2POrKQgJuoVZRdwJTo64DJ_hm12KuPky4D@mail.gmail.com> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1891310359==" Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xensource.com Errors-To: xen-users-bounces@lists.xensource.com To: Olivier Hanesse Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Jan Beulich , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org --===============1891310359== Content-Type: multipart/alternative; boundary="__129890705141018905abhmt020" --__129890705141018905abhmt020 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Hi Olivier - =20 By "warp=3D3000+ in debug message" do you mean the Xen boot message "TSC h= as constant rate..., warp =3D NNNN"? =20 If so, this is a very different "warp" measured in cycles, not in seconds, = so 3000 is more like a microsecond not an hour, and this is normal (not a b= ad sign). =20 Dan =20 From: Olivier Hanesse [mailto:olivier.hanesse@gmail.com]=20 Sent: Monday, February 28, 2011 8:23 AM To: Dan Magenheimer Cc: Jeremy Fitzhardinge; Keir Fraser; Jan Beulich; Mark Adams; xen-devel@li= sts.xensource.com; Xen Users; Keir Fraser Subject: Re: [Xen-devel] Xen 4 TSC problems =20 Keir :=20 =20 Yes, it is "under progress".=20 To make this change, I had to reboot every server, so it is taking time (pr= oduction server :() So i was hoping to find a quick method to mitigate this issue on domUs whil= e rebooting servers. =20 As this bug happens once or twice per server since October, I can't say tha= t right now that changing platform timer to PIT fixed it. I have to wait (I= hope forever!) this bug to happen again on a 'patched' server ...=20 =20 But even with clcoksource=3Dpit, I am seeing some warp=3D3000+ in debug mes= sage ? I guess it is not a good sign, is it ? =20 Jan : I was hoping to find a way to make the domU clocksource more "indepen= dent" like with xen3.2. =20 =20 2011/2/28 Dan Magenheimer Hi Olivier - =20 It is the Xen clocksource that you want to try to change, not the dom0 cloc= ksource. To do this, you need to specify "clocksource=3Dpit" on the Xen bo= ot line (and reboot), not the dom0 boot line. =20 I believe Mark Adams played with tsc_mode to see if it solved! his (similar= ? identical?) problem last year, and it didn't make any difference. Please try booting Xen with "clocksource=3Dpit" and ensure that "Platform t= imer is 1.19MHz PIT" appears in the Xen boot messages. If the 50min jump d= oes not appear again, it would point to a problem in the hpet, either hardw= are or software. =20 Thanks, Dan =20 From: Olivier Hanesse [mailto:HYPERLINK "mailto:olivier.hanesse@gmail.com" = \nolivier.hanesse@gmail.com]=20 Sent: Monday, February 28, 2011 7:37 AM To: Jeremy Fitzhardinge Cc: Dan Magenheimer; Keir Fraser; Jan Beulich; Mark Adams; HYPERLINK "mailt= o:xen-devel@lists.xensource.com" \nxen-devel@lists.xensource.com; Xen Users= ; Keir Fraser Subject: Re: [Xen-devel] Xen 4 TSC problems =20 Hello, =20 It happened again twice this weekend. =20 What about setting "tsc_mode=3D2" for my vms ? Should this mode prevent thi= s bug (coming from a bad emulated tsc due to firmware issue ? is it possibl= e ?) from affecting time in domUs ? =20 Setting clocksource=3Dpit, make 'tsc' available in "/sys/devices/system/clo= cksource/clocksource0/available_clocksource" (otherwise only xen is availab= le, is it normal ? ).=20 =20 Should I bypass xen clocksource and use tsc as a clocksource for dom0/domU = ? or will it be worsed ? =20 Regards =20 Olivier =20 2011/2/24 Jeremy Fitzhardinge On 02/24/2011 09:43 AM, Dan Magenheimer wrote: > Just a wild guess, but this in Olivier's posted output: > > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more time= s. > > and the fact that a 32-bit HPET wrap is ~300 seconds and, with the > "10 or more times", 10 * 300 seconds is 3000 seconds, might be a clue > (or a complete red herring, but I thought it worth mentioning). > > Mark and Olivier, it would be interesting to know if you are > using the same processor/system. It definitely seems like some kind of problem on the host system rather than anything in the guests themselves. ! If the platform timer is misbehaving, then Xen could be completely screwing up the pvclock calibration which it then passes to guests. Could it be one of those "platform clock stops in certain power states" problems? J >> -----Original Message----- >> From: Keir Fraser [mailto:HYPERLINK "mailto:keir.xen@gmail.com" \nkeir.x= en@gmail.com] >> Sent: Thursday, February 24, 2011 7:52 AM >> To: Olivier Hanesse; Jan Beulich >> Cc: Mark Adams; Jeremy Fitzhardinge; HYPERLINK "mailto:xen-devel@lists.x= ensource.com" \nxen-devel@lists.xensource.com; Xen >> Users; Dan Magenheimer; Keir Fraser >> Subject: Re: [Xen-devel] Xen 4 TSC problems >> >> On 24/02/2011 14:20, "Olivier Hanesse" >> wrote: >> >>> Both dom0 and domUs are affected by this" jump". >>> >>> I expect to see something like "TSC marked as reliable, warp =3D 0". >>> I got this on newer hardware with same config/distros. >> It depends on the CPU itself, older CPUs do not have the super-stable >> TSC >> features. But that should never cause a massive 3000s time jump. >> >>> Is there a way to measure if it is a TSC warp ? to point out a cpu >> tsc issue ? >> >> The TSC warps or out-of-sync issues that we could reasonably expect >> would be >> on the order of microseconds. A 3000s warp is something else entirely. >> Xen >> is very confused and/or some TSC or platform timer has jumped a long >> way >> (indicating a hardware/firmware issue). >> >> -- Keir >> >>! ;> 2011/2/24 Jan Beulich >>>>>>> On 24.02.11 at 12:57, Olivier Hanesse >> wrote: >>>>> I tried to turn off cstates with max_cstate=3D0 without success >> (still "not >>>>> reliable"). >>>>> >>>>> With cpuidle=3D0, I also got : >>>>> >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not >> reliable, >>>>> warp=3D3022 (count=3D1) >>>> This message by itself isn't telling much I believe. >>>> >>>>> xm info | grep command >>>>> xen_commandline : dom0_mem=3D512M cpuidle=3D0 loglvl=3Dall >> guest_loglvl=3Dall >>>>> dom0_max_vcpus=3D1 dom0_vcp! us_pin console=3Dvga,com1 com1=3D19200,8= n1 >>>>> >>>>> Keir : >>>>> >>>>> Using clocksource=3Dpit : >>>>> >>>>> (XEN) Platform timer is 1.193MHz PIT >>>>> >>>>> I also got : >>>>> >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not >> reliable, >>>>> warp=3D3262 (count=3D2) >>>> The question is whether any of this eliminates the time jumps seen >>>> by your DomU-s (from your past mails I wasn't actually sure whether >>>> Dom0 also experienced this problem, albeit it would be odd if it >> didn't). >>>> Jan >>>> >>>> Jan >>>> >>> >> =20 =20 --__129890705141018905abhmt020 Content-Type: text/html; charset=us-ascii Content-Transfer-Encoding: quoted-printable

Hi Ol= ivier –

&nb= sp;

By “warp=3D3000+ = in  debug message” do you mean the Xen boot message “TSC= has constant rate..., warp =3D NNNN”?

 

<= span style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#= 1F497D'>If so, this is a very different “warp” measured in cy= cles, not in seconds, so 3000 is more like a microsecond not an hour, ! =20 and this is normal (not a bad sign).

 

Dan<= /o:p>

 

From: Olivier Hanesse [ma= ilto:olivier.hanesse@gmail.com]
Sent: Monday, February 28, 201= 1 8:23 AM
To: Dan Magenheimer
Cc: Jeremy Fitzhardinge= ; Keir Fraser; Jan Beulich; Mark Adams; xen-devel@lists.xensource.com; Xen Users; Keir Fraser
Subject: = Re: [Xen-devel] Xen 4 TSC problems

 

Keir :&nbs= p;

 

Yes, it is "under progress". =

To make this change, I had= to reboot every server, so it is taking time (production server :()=

So i was hoping to find a quick= method to mitigate this issue on domUs while rebooting servers.

 

As this bug happens once or twice per server since Octo= ber, I can't say that right now that changing platform timer to PIT fixed= it. I have to wait (I hope forever!) this bug to happen again on a 'patc= hed' server ... 

 

Bu= t even with clcoksource=3Dpit, I am seeing some warp=3D3000+ in debug mes= sage ? I guess it is not a good sign, is it ?

 

J= an : I was hoping to find a way to make the domU clocksource more "i= ndependent" like with xen3.2.

 

 <= /o:p>

2011/2/28 Dan Magenheimer <dan.magenheimer@oracle.com>

Hi Olivier –

 

It is the Xen clocks= ource that you want to try to change, not the dom0 clocksource.  To = do this, you need to specify “clocksource=3Dpit” on the Xen b= oot line (and reboot), not the dom0 boot line.

 

I believe M= ark Adams played with tsc_mode to see if it solved! his (similar? identic= al?) problem last year, and it didn’t make any difference.


= Please try booting Xen with “clocksource=3Dpit” and ensu! =20 re that “Platform timer is 1.19MHz PIT” appears in the Xen boot messa= ges.  If the 50min jump does not appear again, it would point to a p= roblem in the hpet, either hardware or software.

 

Thanks,

Dan

 

From: Olivier Hanesse [mailto:olivier.hanesse@gmail.com]
Se= nt: Monday, February 28, 2011 7:37 AM
To: Jeremy Fitzhardin= ge
Cc: Dan Magenheimer; Keir Fraser; Jan Beulich; Mark Adams; <= a href=3D"mailto:xen-devel@lists.xensource.com" target=3D"_blank">xen-dev= el@lists.xensource.com; Xen Users; Keir Fraser

<= div>


Subject: Re: [Xen-devel] Xen 4 T= SC problems

&nbs= p;

Hello,

 <= /o:p>

It happened again twice this weekend.<= /p>

 

What about sett= ing "tsc_mode=3D2" for my vms ? Should this mode prevent this b= ug (coming from a bad emulated tsc due to firmware issue ? is it possible= ?) from affecting time in domUs ?

&nbs= p;

Setting clocksource=3Dpit, make 'tsc= ' available in "/sys/devices/system/clocksource/clocksource0/availab= le_clocksource" (otherwise only xen is available, is it norma! =20 l ? ). 

 

Should I bypass xen clocksource and use tsc as a clocksource f= or dom0/domU ? or  will it be worsed ?

 

Regards

 

Olivier

 

=

2011/2/24 Je= remy Fitzhardinge <jeremy@goop.org>

On 02/24/2011 09:43 AM, Da= n Magenheimer wrote:
> Just a wild guess, but this in Olivier's pos= ted output:
>
> (XEN) Platform timer appears to have unexpect= edly wrapped 10 or more times.
>
> and the fact that a 32-bit= HPET wrap is ~300 seconds and, with the
> "10 or more times&q= uot;, 10 * 300 seconds is 3000 seconds, might be a clue
> (or a com= plete red herring, but I thought it worth mentioning).
>
> Ma= rk and Olivier, it would be interesting to know if you are
> using = the same processor/system.

It definitely see= ms like some kind of problem on the host system rather
than anything in the guests themselves. !  If the platform= timer is
misbehaving, then Xen could be completely screwing up the pv= clock
calibration which it then passes to guests.

Could it be o= ne of those "platform clock stops in certain power states"
p= roblems?


   J

>> -----Original Message-----
>&= gt; From: Keir Fraser [mailto:keir.xen@gmail.com]
>> Sent: Thursday, February= 24, 2011 7:52 AM
>> To: Olivier Hanesse; Jan Beulich
>>= ; Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xen>> Users; Dan Magenheimer; Keir Fraser
>> Subject: Re: [= Xen-devel] Xen 4 TSC problems
>>
>> On 24/02/2011 14:20= , "Olivier Hanesse" <olivier.hanes= se@gmail.com>
>> wrote:
>>
>>> Both = dom0 and domUs are affected by this" jump".
>>>
= >>> I expect to see something like "TSC marked as reliable,= warp =3D 0".
>>> I got this on newer hardware with same= config/distros.
>> It depends on the CPU itself, older CPUs do = not have the super-stable
>> TSC
>> features. But that = should never cause a massive 3000s time jump.
>>
>>>= Is there a way to measure if it is a TSC warp ? to point out a cpu
&g= t;> tsc issue ?
>>
>> The TSC warps or out-of-sync i= ssues that we could reasonably expect
>> would be
>> on= the order of microseconds. A 3000s warp is something else entirely.
&= gt;> Xen
>> is very confused and/or some TSC or platform time= r has jumped a long
>> way
>> =20 (indicating a hardware/firmware issue).
>>
>>  -- = Keir
>>

>&= gt! ;> 2011/2/24 Jan Beulich <JBeulich@novell.com>


>>>>>>> On 24.02.11 at 12:57, Oli= vier Hanesse <olivier.hanesse@gmail.com>
>> wrote:
>>&g= t;>> I tried to turn off cstates with max_cstate=3D0 without succes= s
>> (still "not
>>>>> reliable").>>>>>
>>>>> With cpuidle=3D0, I also g= ot :
>>>>>
>>>>> (XEN) TSC has consta= nt rate, deep Cstates possible, so not
>> reliable,
>>&= gt;>> warp=3D3022 (count=3D1)
>>>> This message by i= tself isn't telling much I believe.
>>>>
>>>>> xm info | grep comma= nd
>>>>> xen_commandline        : d= om0_mem=3D512M cpuidle=3D0 loglvl=3Dall
>> guest_loglvl=3Dall

>>>>> dom0_max_vcpu= s=3D1 dom0_vcp! us_pin console=3Dvga,com1 com1=3D19200,8n1

=


>>>>>
>>>>>= ; Keir :
>>>>>
>>>>> Using clocksourc= e=3Dpit :
>>>>>
>>>>> (XEN) Platform = timer is 1.193MHz PIT
>>>>>
>>>>> I a= lso got :
>>>>>
>>>>> (XEN) TSC has c= onstant rate, deep Cstates possible, so not
>> reliable,
>= >>>> warp=3D3262 (count=3D2)
>>>> The question= is whether any of this eliminates the time jumps seen
>>>>= ; by your DomU-s (from your past mails I wasn't actually sure whether
>>>> Dom0 also experienced this problem, albeit it= would be odd if it
>> didn't).
>>>> Jan
>&= gt;>>
>>>> Jan
>>>>
>>>>>

 

 

--__129890705141018905abhmt020-- --===============1891310359== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-users mailing list Xen-users@lists.xensource.com http://lists.xensource.com/xen-users --===============1891310359==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Keir Fraser Subject: Re: [Xen-devel] Xen 4 TSC problems Date: Mon, 28 Feb 2011 15:39:48 +0000 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset="ISO-8859-1" Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xensource.com Errors-To: xen-users-bounces@lists.xensource.com To: Olivier Hanesse , Dan Magenheimer Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Jan Beulich , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 28/02/2011 15:23, "Olivier Hanesse" wrote: > Keir :=A0 >=20 > Yes, it is "under progress".=A0 > To make this change, I had to reboot every server, so it is taking time > (production server :() > So i was hoping to find a quick method to mitigate this issue on domUs wh= ile > rebooting servers. >=20 > As this bug happens once or twice per server since October, I can't say t= hat > right now that changing platform timer to PIT fixed it. I have to wait (I= hope > forever!) this bug to happen again on a 'patched' server ...=A0 >=20 > But even with clcoksource=3Dpit, I am seeing some warp=3D3000+ in debug messa= ge ? > I guess it is not a good sign, is it ? Better not to have it, but honestly you're very unlikely to see any problem from it. It's totally unrelated to the 3000-second time jumps. -- Keir > Jan : I was hoping to find a way to make the domU clocksource more > "independent" like with xen3.2. >=20 >=20 > 2011/2/28 Dan Magenheimer >> Hi Olivier =AD >> =A0 >> It is the Xen clocksource that you want to try to change, not the dom0 >> clocksource.=A0 To do this, you need to specify =B3clocksource=3Dpit=B2 on the X= en >> boot line (and reboot), not the dom0 boot line. >> =A0 >> I believe Mark Adams played with tsc_mode to see if it solved! his (simi= lar? >> identical?) problem last year, and it didn=B9t make any difference. >>=20 >> Please try booting Xen with =B3clocksource=3Dpit=B2 and ensure that =B3Platform = timer >> is 1.19MHz PIT=B2 appears in the Xen boot messages.=A0 If the 50min jump doe= s not >> appear again, it would point to a problem in the hpet, either hardware o= r >> software. >> =A0 >> Thanks, >> Dan >> =A0 >>=20 >> From: Olivier Hanesse [mailto:olivier.hanesse@gmail.com] >> Sent: Monday, February 28, 2011 7:37 AM >> To: Jeremy Fitzhardinge >> Cc: Dan Magenheimer; Keir Fraser; Jan Beulich; Mark Adams; >> xen-devel@lists.xensource.com; Xen Users; Keir Fraser >>=20 >>=20 >> Subject: Re: [Xen-devel] Xen 4 TSC problems >> =A0 >>=20 >> Hello, >>=20 >> =A0 >> It happened again twice this weekend. >>=20 >> =A0 >>=20 >> What about setting "tsc_mode=3D2" for my vms ? Should this mode prevent th= is >> bug (coming from a bad emulated tsc due to firmware issue ? is it possib= le ?) >> from affecting time in domUs ? >>=20 >> =A0 >>=20 >> Setting clocksource=3Dpit, make 'tsc' available in >> "/sys/devices/system/clocksource/clocksource0/available_clocksource" >> (otherwise only xen is available, is it normal ? ).=A0 >>=20 >> =A0 >>=20 >> Should I bypass xen clocksource and use tsc as a clocksource for dom0/do= mU ? >> or =A0will it be worsed ? >>=20 >> =A0 >>=20 >> Regards >>=20 >> =A0 >>=20 >> Olivier >>=20 >> =A0 >>=20 >> 2011/2/24 Jeremy Fitzhardinge >>=20 >> On 02/24/2011 09:43 AM, Dan Magenheimer wrote: >>> Just a wild guess, but this in Olivier's posted output: >>>=20 >>> (XEN) Platform timer appears to have unexpectedly wrapped 10 or more ti= mes. >>>=20 >>> and the fact that a 32-bit HPET wrap is ~300 seconds and, with the >>> "10 or more times", 10 * 300 seconds is 3000 seconds, might be a clue >>> (or a complete red herring, but I thought it worth mentioning). >>>=20 >>> Mark and Olivier, it would be interesting to know if you are >>> using the same processor/system. >> It definitely seems like some kind of problem on the host system rather >> than anything in the guests themselves. ! =A0If the platform timer is >> misbehaving, then Xen could be completely screwing up the pvclock >> calibration which it then passes to guests. >>=20 >> Could it be one of those "platform clock stops in certain power states" >> problems? >>=20 >>=20 >> =A0 =A0J >>=20 >>>> -----Original Message----- >>>> From: Keir Fraser [mailto:keir.xen@gmail.com] >>>> Sent: Thursday, February 24, 2011 7:52 AM >>>> To: Olivier Hanesse; Jan Beulich >>>> Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xe= n >>>> Users; Dan Magenheimer; Keir Fraser >>>> Subject: Re: [Xen-devel] Xen 4 TSC problems >>>>=20 >>>> On 24/02/2011 14:20, "Olivier Hanesse" >>>> wrote: >>>>=20 >>>>> Both dom0 and domUs are affected by this" jump". >>>>>=20 >>>>> I expect to see something like "TSC marked as reliable, warp =3D 0". >>>>> I got this on newer hardware with same config/distros. >>>> It depends on the CPU itself, older CPUs do not have the super-stable >>>> TSC >>>> features. But that should never cause a massive 3000s time jump. >>>>=20 >>>>> Is there a way to measure if it is a TSC warp ? to point out a cpu >>>> tsc issue ? >>>>=20 >>>> The TSC warps or out-of-sync issues that we could reasonably expect >>>> would be >>>> on the order of microseconds. A 3000s warp is something else entirely. >>>> Xen >>>> is very confused and/or some TSC or platform timer has jumped a long >>>> way >>>> (indicating a hardware/firmware issue). >>>>=20 >>>> =A0-- Keir >>>>=20 >>> >! ;> 2011/2/24 Jan Beulich >>=20 >>>>>>>>> On 24.02.11 at 12:57, Olivier Hanesse >>>> wrote: >>>>>>> I tried to turn off cstates with max_cstate=3D0 without success >>>> (still "not >>>>>>> reliable"). >>>>>>>=20 >>>>>>> With cpuidle=3D0, I also got : >>>>>>>=20 >>>>>>> (XEN) TSC has constant rate, deep Cstates possible, so not >>>> reliable, >>>>>>> warp=3D3022 (count=3D1) >>>>>> This message by itself isn't telling much I believe. >>>>>>=20 >>>>>>> xm info | grep command >>>>>>> xen_commandline =A0 =A0 =A0 =A0: dom0_mem=3D512M cpuidle=3D0 loglvl=3Dall >>>> guest_loglvl=3Dall >>>>>>> dom0_max_vcpus=3D1 dom0_vcp! us_pin console=3Dvga,com1 com1=3D19200,8n1 >>=20 >>>>>>>=20 >>>>>>> Keir : >>>>>>>=20 >>>>>>> Using clocksource=3Dpit : >>>>>>>=20 >>>>>>> (XEN) Platform timer is 1.193MHz PIT >>>>>>>=20 >>>>>>> I also got : >>>>>>>=20 >>>>>>> (XEN) TSC has constant rate, deep Cstates possible, so not >>>> reliable, >>>>>>> warp=3D3262 (count=3D2) >>>>>> The question is whether any of this eliminates the time jumps seen >>>>>> by your DomU-s (from your past mails I wasn't actually sure whether >>>>>> Dom0 also experienced this problem, albeit it would be odd if it >>>> didn't). >>>>>> Jan >>>>>>=20 >>>>>> Jan >>>>>>=20 >>>>>=20 >>>>=20 >> =A0 >=20 >=20 From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Re: Xen 4 TSC problems Date: Mon, 28 Feb 2011 16:54:29 +0100 Message-ID: References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0467043827==" Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Dan Magenheimer Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Jan Beulich , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org --===============0467043827== Content-Type: multipart/alternative; boundary=90e6ba6e80fef3c66d049d59b13d --90e6ba6e80fef3c66d049d59b13d Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Yes this is what I mean. I am glad to hear that it isn't a bad sign :) I thought of a bad sign, because on system with "reliable TSC", this counte= r is always 0. 2011/2/28 Dan Magenheimer > Hi Olivier =96 > > > > By =93warp=3D3000+ in debug message=94 do you mean the Xen boot message = =93TSC has > constant rate..., warp =3D NNNN=94? > > > > If so, this is a very different =93warp=94 measured in cycles, not in sec= onds, > so 3000 is more like a microsecond not an hour, ! and this is normal (not= a > bad sign). > > > > Dan > > > > *From:* Olivier Hanesse [mailto:olivier.hanesse@gmail.com] > *Sent:* Monday, February 28, 2011 8:23 AM > *To:* Dan Magenheimer > *Cc:* Jeremy Fitzhardinge; Keir Fraser; Jan Beulich; Mark Adams; > xen-devel@lists.xensource.com; Xen Users; Keir Fraser > > *Subject:* Re: [Xen-devel] Xen 4 TSC problems > > > > Keir : > > > > Yes, it is "under progress". > > To make this change, I had to reboot every server, so it is taking time > (production server :() > > So i was hoping to find a quick method to mitigate this issue on domUs > while rebooting servers. > > > > As this bug happens once or twice per server since October, I can't say > that right now that changing platform timer to PIT fixed it. I have to wa= it > (I hope forever!) this bug to happen again on a 'patched' server ... > > > > But even with clcoksource=3Dpit, I am seeing some warp=3D3000+ in debug m= essage > ? I guess it is not a good sign, is it ? > > > > Jan : I was hoping to find a way to make the domU clocksource more > "independent" like with xen3.2. > > > > > > 2011/2/28 Dan Magenheimer > > Hi Olivier =96 > > > > It is the Xen clocksource that you want to try to change, not the dom0 > clocksource. To do this, you need to specify =93clocksource=3Dpit=94 on = the Xen > boot line (and reboot), not the dom0 boot line. > > > > I believe Mark Adams played with tsc_mode to see if it solved! his > (similar? identical?) problem last year, and it didn=92t make any differe= nce. > > > Please try booting Xen with =93clocksource=3Dpit=94 and ensu! re that =93= Platform > timer is 1.19MHz PIT=94 appears in the Xen boot messages. If the 50min j= ump > does not appear again, it would point to a problem in the hpet, either > hardware or software. > > > > Thanks, > > Dan > > > > *From:* Olivier Hanesse [mailto:olivier.hanesse@gmail.com] > *Sent:* Monday, February 28, 2011 7:37 AM > *To:* Jeremy Fitzhardinge > *Cc:* Dan Magenheimer; Keir Fraser; Jan Beulich; Mark Adams; > xen-devel@lists.xensource.com; Xen Users; Keir Fraser > > > *Subject:* Re: [Xen-devel] Xen 4 TSC problems > > > > Hello, > > > > It happened again twice this weekend. > > > > What about setting "tsc_mode=3D2" for my vms ? Should this mode prevent t= his > bug (coming from a bad emulated tsc due to firmware issue ? is it possibl= e > ?) from affecting time in domUs ? > > > > Setting clocksource=3Dpit, make 'tsc' available in > "/sys/devices/system/clocksource/clocksource0/available_clocksource" > (otherwise only xen is available, is it norma! l ? ). > > > > Should I bypass xen clocksource and use tsc as a clocksource for dom0/dom= U > ? or will it be worsed ? > > > > Regards > > > > Olivier > > > > 2011/2/24 Jeremy Fitzhardinge > > On 02/24/2011 09:43 AM, Dan Magenheimer wrote: > > Just a wild guess, but this in Olivier's posted output: > > > > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more > times. > > > > and the fact that a 32-bit HPET wrap is ~300 seconds and, with the > > "10 or more times", 10 * 300 seconds is 3000 seconds, might be a clue > > (or a complete red herring, but I thought it worth mentioning). > > > > Mark and Olivier, it would be interesting to know if you are > > using the same processor/system. > > It definitely seems like some kind of problem on the host system rather > than anything in the guests themselves. ! If the platform timer is > misbehaving, then Xen could be completely screwing up the pvclock > calibration which it then passes to guests. > > Could it be one of those "platform clock stops in certain power states" > problems? > > > J > > >> -----Original Message----- > >> From: Keir Fraser [mailto:keir.xen@gmail.com] > >> Sent: Thursday, February 24, 2011 7:52 AM > >> To: Olivier Hanesse; Jan Beulich > >> Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Xe= n > >> Users; Dan Magenheimer; Keir Fraser > >> Subject: Re: [Xen-devel] Xen 4 TSC problems > >> > >> On 24/02/2011 14:20, "Olivier Hanesse" > >> wrote: > >> > >>> Both dom0 and domUs are affected by this" jump". > >>> > >>> I expect to see something like "TSC marked as reliable, warp =3D 0". > >>> I got this on newer hardware with same config/distros. > >> It depends on the CPU itself, older CPUs do not have the super-stable > >> TSC > >> features. But that should never cause a massive 3000s time jump. > >> > >>> Is there a way to measure if it is a TSC warp ? to point out a cpu > >> tsc issue ? > >> > >> The TSC warps or out-of-sync issues that we could reasonably expect > >> would be > >> on the order of microseconds. A 3000s warp is something else entirely. > >> Xen > >> is very confused and/or some TSC or platform timer has jumped a long > >> way > >> (indicating a hardware/firmware issue). > >> > >> -- Keir > >> > > >>! ;> 2011/2/24 Jan Beulich > > > >>>>>>> On 24.02.11 at 12:57, Olivier Hanesse > >> wrote: > >>>>> I tried to turn off cstates with max_cstate=3D0 without success > >> (still "not > >>>>> reliable"). > >>>>> > >>>>> With cpuidle=3D0, I also got : > >>>>> > >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not > >> reliable, > >>>>> warp=3D3022 (count=3D1) > >>>> This message by itself isn't telling much I believe. > >>>> > >>>>> xm info | grep command > >>>>> xen_commandline : dom0_mem=3D512M cpuidle=3D0 loglvl=3Dall > >> guest_loglvl=3Dall > > >>>>> dom0_max_vcpus=3D1 dom0_vcp! us_pin console=3Dvga,com1 com1=3D19200= ,8n1 > > > >>>>> > >>>>> Keir : > >>>>> > >>>>> Using clocksource=3Dpit : > >>>>> > >>>>> (XEN) Platform timer is 1.193MHz PIT > >>>>> > >>>>> I also got : > >>>>> > >>>>> (XEN) TSC has constant rate, deep Cstates possible, so not > >> reliable, > >>>>> warp=3D3262 (count=3D2) > >>>> The question is whether any of this eliminates the time jumps seen > >>>> by your DomU-s (from your past mails I wasn't actually sure whether > >>>> Dom0 also experienced this problem, albeit it would be odd if it > >> didn't). > >>>> Jan > >>>> > >>>> Jan > >>>> > >>> > >> > > > > > --90e6ba6e80fef3c66d049d59b13d Content-Type: text/html; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Ye= s this is what I mean.
I am glad to hear that it isn't a bad sign := )
I thought of a bad sign, because on system with "reliable TS= C", this counter is always 0.

2011/2/28 Dan Magenheimer <= span dir=3D"ltr"><dan.mage= nheimer@oracle.com>

Hi Olivier =96

=A0<= /span>

By =93warp=3D3000+ in=A0 deb= ug message=94 do you mean the Xen boot message =93TSC has constant rate...,= warp =3D NNNN=94?

=A0

If so= , this is a very different =93warp=94 measured in cycles, not in seconds, s= o 3000 is more like a microsecond not an hour, ! and this is normal (not a bad sign).

=A0

Dan

=A0

From: Olivier Hanesse [mai= lto:olivier.= hanesse@gmail.com]
Sent: Monday, February 28, 2011 8:23 AM
To: Dan Magenheime= r
Cc: Jeremy Fitzhardinge; Keir Fraser; Jan Beulich; Mark Adams; xen-deve= l@lists.xensource.com; Xen Users; Keir Fraser


Subject: Re: [Xen-devel] Xen 4 TSC problems

=A0

Keir :=A0

=A0

Yes, it is "under progress".=A0

To make this change, I had to reboot every server, so it is taking time (pr= oduction server :()

So i was hoping to= find a quick method to mitigate this issue on domUs while rebooting server= s.

=A0

= As this bug happens once or twice per server since October, I can't say= that right now that changing platform timer to PIT fixed it. I have to wai= t (I hope forever!) this bug to happen again on a 'patched' server = ...=A0

=A0

= But even with clcoksource=3Dpit, I am seeing some warp=3D3000+ in debug mes= sage ? I guess it is not a good sign, is it ?

=A0

Jan : I was hoping to find a way t= o make the domU clocksource more "independent" like with xen3.2.<= /p>

=A0

=A0

2011/2/28 Dan Mag= enheimer <dan.magenheimer@oracle.com>

<= p class=3D"MsoNormal"> Hi Olivier =96

=A0

It is the Xen clocksource that you want to try to change, not the dom0 clo= cksource.=A0 To do this, you need to specify =93clocksource=3Dpit=94 on the= Xen boot line (and reboot), not the dom0 boot line.

=A0

I believe Mark Adams played with tsc_mode to see if it solved! his (sim= ilar? identical?) problem last year, and it didn=92t make any difference.


Please try booting Xen with =93clocksource=3Dpit=94 and ensu! re that =93Platform timer is 1.19MHz PIT=94 appears in the Xen boot messages.=A0 If= the 50min jump does not appear again, it would point to a problem in the h= pet, either hardware or software.

=A0

Thank= s,

Dan

=A0

From: Ol= ivier Hanesse [mailto:olivier.hanesse@gmail.com]
Sent: Monday, February 28, 2011 7:37 AM
To: Jeremy Fitzhar= dinge
Cc: Dan Magenheimer; Keir Fraser; Jan Beulich; Mark Adams; = xen-deve= l@lists.xensource.com; Xen Users; Keir Fraser


Subject: Re: [Xen-devel] Xen 4 = TSC problems

<= p class=3D"MsoNormal">=A0

Hello,

=A0

It happened again twice this weekend.

=A0

What= about setting "tsc_mode=3D2" for my vms ? Should this mode preve= nt this bug (coming from a bad emulated tsc due to firmware issue ? is it p= ossible ?) from affecting time in domUs ?

=A0

Setting clocksource=3Dpit, make 'tsc' available in "/sys= /devices/system/clocksource/clocksource0/available_clocksource" (other= wise only xen is available, is it norma! l ? ).=A0

=A0

Should I bypass xen clocksource = and use tsc as a clocksource for dom0/domU ? or =A0will it be worsed ?

<= /div>

=A0

Regards

=

=A0

= Olivier

=

=A0

2011/2/24 Jeremy Fitzhardinge = <jeremy@goop.org>

It de= finitely seems like some kind of problem on the host system rather
than anything in the guests themselves. ! =A0If the platform time= r is
misbehaving, then Xen could be completely screwing up the pvclockcalibration which it then passes to guests.

Could it be one of tho= se "platform clock stops in certain power states"
problems?


=A0= =A0J

>> -----Original Message-----
>> From: Keir Fra= ser [mailto:
keir.xe= n@gmail.com]
>> Sent: Thursday, February 24, 2011 7:52 AM
>> To: Olivier = Hanesse; Jan Beulich
>> Cc: Mark Adams; Jeremy Fitzhardinge; xen-devel@lis= ts.xensource.com; Xen
>> Users; Dan Magenheimer; Keir Fraser
>> Subject: Re: [Xen-= devel] Xen 4 TSC problems
>>
>> On 24/02/2011 14:20, &quo= t;Olivier Hanesse" <olivier.hanesse@gmail.com>
>> wrote:
>>
>>> Both dom0 and domUs are affecte= d by this" jump".
>>>
>>> I expect to see= something like "TSC marked as reliable, warp =3D 0".
>>= > I got this on newer hardware with same config/distros.
>> It depends on the CPU itself, older CPUs do not have the super-sta= ble
>> TSC
>> features. But that should never cause a mas= sive 3000s time jump.
>>
>>> Is there a way to measure= if it is a TSC warp ? to point out a cpu
>> tsc issue ?
>>
>> The TSC warps or out-of-sync i= ssues that we could reasonably expect
>> would be
>> on t= he order of microseconds. A 3000s warp is something else entirely.
>&= gt; Xen
>> is very confused and/or some TSC or platform timer has jumped a lo= ng
>> way
>> (indicating a hardware/firmware issue).
>>
>> =A0-- Keir<= br>>>

>&gt! ;> 2011/2= /24 Jan Beulich <JBeulich@novell.com>


>>>>>>> On 24.02.11 at= 12:57, Olivier Hanesse <olivier.hanesse@gmail.com>
>> wrote:
>= ;>>>> I tried to turn off cstates with max_cstate=3D0 without s= uccess
>> (still "not
>>>>> reliable").
>&= gt;>>>
>>>>> With cpuidle=3D0, I also got :
&= gt;>>>>
>>>>> (XEN) TSC has constant rate, de= ep Cstates possible, so not
>> reliable,
>>>>> warp=3D3022 (count=3D1)
>&= gt;>> This message by itself isn't telling much I believe.
>>>>
>>>>> xm info | grep command=
>>>>> xen_commandline =A0 =A0 =A0 =A0: dom0_mem=3D512M c= puidle=3D0 loglvl=3Dall
>> guest_loglvl=3Dall

>>>>> dom0_max_vcpus=3D1 dom0_vcp! us_pin consol= e=3Dvga,com1 com1=3D19200,8n1


>>>>>
>>>>>= ; Keir :
>>>>>
>>>>> Using clocksource= =3Dpit :
>>>>>
>>>>> (XEN) Platform tim= er is 1.193MHz PIT
>>>>>
>>>>> I also got :
>>>&g= t;>
>>>>> (XEN) TSC has constant rate, deep Cstates po= ssible, so not
>> reliable,
>>>>> warp=3D3262 (c= ount=3D2)
>>>> The question is whether any of this eliminates the time ju= mps seen
>>>> by your DomU-s (from your past mails I wasn= 9;t actually sure whether
>>>> Dom0 also experienced this problem, albeit it w= ould be odd if it
>> didn't).
>>>> Jan
>&= gt;>>
>>>> Jan
>>>>
>>>
>>

=A0

=A0

=

--90e6ba6e80fef3c66d049d59b13d-- --===============0467043827== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xensource.com http://lists.xensource.com/xen-devel --===============0467043827==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: "andre.arnold" Subject: Re: Xen 4 TSC problems Date: Fri, 15 Apr 2011 00:51:46 -0700 (PDT) Message-ID: <1302853906683-4304962.post@n5.nabble.com> References: <4D665F3102000078000337B6@vpn.id2.novell.com> <68c41dd8-9195-41b0-83d7-9242b8eff809@default> <4D66AB26.9020000@goop.org> <49766a9d-0e37-46d2-9497-33c2e981a871@default> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: xen-devel@lists.xensource.com List-Id: xen-devel@lists.xenproject.org Hi, we experienced the same problem with xen 4 under debian squeeze on our DELL PowerEdge R815 Servers. Does the "clocksource=pit" setting solve the problem? Cheers Andre -- View this message in context: http://xen.1045712.n5.nabble.com/Xen-4-TSC-problems-tp3396848p4304962.html Sent from the Xen - Dev mailing list archive at Nabble.com. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Re: Re: Xen 4 TSC problems Date: Fri, 15 Apr 2011 18:31:43 +0200 Message-ID: References: <4D665F3102000078000337B6@vpn.id2.novell.com> <68c41dd8-9195-41b0-83d7-9242b8eff809@default> <4D66AB26.9020000@goop.org> <49766a9d-0e37-46d2-9497-33c2e981a871@default> <1302853906683-4304962.post@n5.nabble.com> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0276316951==" Return-path: In-Reply-To: <1302853906683-4304962.post@n5.nabble.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: "andre.arnold" Cc: xen-devel@lists.xensource.com List-Id: xen-devel@lists.xenproject.org --===============0276316951== Content-Type: multipart/alternative; boundary=bcaec51b1b4113b71c04a0f794ff --bcaec51b1b4113b71c04a0f794ff Content-Type: text/plain; charset=ISO-8859-1 So far yes. Let's wait another month to tell that setting clocksource=pit was the solution :) 2011/4/15 andre.arnold > Hi, > > we experienced the same problem with xen 4 under debian squeeze on > our DELL PowerEdge R815 Servers. > > Does the "clocksource=pit" setting solve the problem? > > Cheers > Andre > > -- > View this message in context: > http://xen.1045712.n5.nabble.com/Xen-4-TSC-problems-tp3396848p4304962.html > Sent from the Xen - Dev mailing list archive at Nabble.com. > > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.xensource.com > http://lists.xensource.com/xen-devel > --bcaec51b1b4113b71c04a0f794ff Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable So far yes.
Let's wait another month to tell that setting clocksour= ce=3Dpit was the solution :)


2011/4/15 andre.arnold <andre.arnold@gmail.com>
Hi,

we experienced the same problem with xen 4 under debian squeeze on
our DELL PowerEdge R815 Servers.

Does the "clocksource=3Dpit" setting solve the problem?

Cheers
Andre

--
View this message in context: http://xen.1045= 712.n5.nabble.com/Xen-4-TSC-problems-tp3396848p4304962.html
Sent from the Xen - Dev mailing list archive at Nabble.com.

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.= com
http://l= ists.xensource.com/xen-devel

--bcaec51b1b4113b71c04a0f794ff-- --===============0276316951== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xensource.com http://lists.xensource.com/xen-devel --===============0276316951==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Philippe Simonet Subject: Re: Xen 4 TSC problems Date: Tue, 13 Sep 2011 09:16:27 +0200 Message-ID: <4E6F034B.5040102@bluewin.ch> References: Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: xen-devel@lists.xensource.com List-Id: xen-devel@lists.xenproject.org Hi Xen developers i just would like to inform you that I have exactly the same problem with Debian squeeze and xen, with 50 seconds time jump on my dom0 and domu. NTP is running on all dom0/domuU, clocksource is 'xen' everywhere. some messages : syslog : Sep 11 13:56:50 dnsit22 kernel: [571603.359863] Clocksource tsc unstable (delta = -2999662111513 ns) xm dmesg : ... (XEN) Platform timer is 14.318MHz HPET ... (XEN) Platform timer appears to have unexpectedly wrapped 10 or more times. (XEN) TSC marked as reliable, warp = 0 (count=2) ... I had some contact with Olivier Hanesse and it indicates that he doesn't have any solution for this problem, and all what was proposed in February didn't solved this problem. all suggestions are welcomed. Best regards Philippe config : -------------------------------------- Linux dnsit22.swissptt.ch 2.6.32-5-xen-amd64 #1 SMP Tue Jun 14 12:46:30 UTC 2011 x86_64 GNU/Linux -------------------------------------- HP DL385 -------------------------------------- vendor_id : AuthenticAMD cpu family : 16 model : 9 model name : AMD Opteron(tm) Processor 6174 stepping : 1 cpu MHz : 3058776.574 cache size : 512 KB fpu : yes fpu_exception : yes cpuid level : 5 wp : yes flags : fpu de tsc msr pae mce cx8 apic mtrr mca cmov pat clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt lm 3dnowext 3dnow constant_tsc rep_good nonstop_tsc extd_apicid amd_dcm pni cx16 popcnt hypervisor lahf_lm cmp_legacy extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch nodeid_msr bogomips : 4409.03 TLB size : 1024 4K pages clflush size : 64 cache_alignment : 64 address sizes : 48 bits physical, 48 bits virtual power management: ts ttp tm stc 100mhzsteps hwpstate -------------------------------------- -------------------------------------- PCI : 00:00.0 Host bridge: ATI Technologies Inc RD890 Northbridge only dual slot (2x16) PCI-e GFX Hydra part (rev 02) 00:04.0 PCI bridge: ATI Technologies Inc RD890 PCI to PCI bridge (PCI express gpp port D) 00:06.0 PCI bridge: ATI Technologies Inc RD890 PCI to PCI bridge (PCI express gpp port F) 00:0a.0 PCI bridge: ATI Technologies Inc RD890 PCI to PCI bridge (external gfx1 port A) 00:0b.0 PCI bridge: ATI Technologies Inc RD890 PCI to PCI bridge (NB-SB link) 00:0d.0 PCI bridge: ATI Technologies Inc RD890 PCI to PCI bridge (external gfx1 port B) 00:11.0 SATA controller: ATI Technologies Inc SB700/SB800 SATA Controller [IDE mode] 00:12.0 USB Controller: ATI Technologies Inc SB700/SB800 USB OHCI0 Controller 00:12.1 USB Controller: ATI Technologies Inc SB700 USB OHCI1 Controller 00:12.2 USB Controller: ATI Technologies Inc SB700/SB800 USB EHCI Controller 00:13.0 USB Controller: ATI Technologies Inc SB700/SB800 USB OHCI0 Controller 00:13.1 USB Controller: ATI Technologies Inc SB700 USB OHCI1 Controller 00:13.2 USB Controller: ATI Technologies Inc SB700/SB800 USB EHCI Controller 00:14.0 SMBus: ATI Technologies Inc SBx00 SMBus Controller (rev 3d) 00:14.1 IDE interface: ATI Technologies Inc SB700/SB800 IDE Controller 00:14.3 ISA bridge: ATI Technologies Inc SB700/SB800 LPC host controller 00:14.4 PCI bridge: ATI Technologies Inc SBx00 PCI to PCI Bridge 00:18.0 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor HyperTransport Configuration 00:18.1 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Address Map 00:18.2 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor DRAM Controller 00:18.3 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Miscellaneous Control 00:18.4 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Link Control 00:19.0 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor HyperTransport Configuration 00:19.1 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Address Map 00:19.2 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor DRAM Controller 00:19.3 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Miscellaneous Control 00:19.4 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Link Control 00:1a.0 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor HyperTransport Configuration 00:1a.1 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Address Map 00:1a.2 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor DRAM Controller 00:1a.3 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Miscellaneous Control 00:1a.4 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Link Control 00:1b.0 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor HyperTransport Configuration 00:1b.1 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Address Map 00:1b.2 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor DRAM Controller 00:1b.3 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Miscellaneous Control 00:1b.4 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Link Control 01:03.0 VGA compatible controller: ATI Technologies Inc ES1000 (rev 02) 02:00.0 System peripheral: Hewlett-Packard Company iLO3 Slave instrumentation & System support (rev 04) 02:00.2 System peripheral: Hewlett-Packard Company iLO3 Management Processor Support and Messaging (rev 04) 02:00.4 USB Controller: Hewlett-Packard Company Proliant iLO2/iLO3 virtual USB controller (rev 01) 03:00.0 RAID bus controller: Hewlett-Packard Company Smart Array G6 controllers (rev 01) 04:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5709 Gigabit Ethernet (rev 20) 04:00.1 Ethernet controller: Broadcom Corporation NetXtreme II BCM5709 Gigabit Ethernet (rev 20) 05:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5709 Gigabit Ethernet (rev 20) 05:00.1 Ethernet controller: Broadcom Corporation NetXtreme II BCM5709 Gigabit Ethernet (rev 20) 09:00.0 PCI bridge: PLX Technology, Inc. PEX 8616 16-lane, 4-Port PCI Express Gen 2 (5.0 GT/s) Switch (rev bb) 0a:04.0 PCI bridge: PLX Technology, Inc. PEX 8616 16-lane, 4-Port PCI Express Gen 2 (5.0 GT/s) Switch (rev bb) 0a:05.0 PCI bridge: PLX Technology, Inc. PEX 8616 16-lane, 4-Port PCI Express Gen 2 (5.0 GT/s) Switch (rev bb) 0a:06.0 PCI bridge: PLX Technology, Inc. PEX 8616 16-lane, 4-Port PCI Express Gen 2 (5.0 GT/s) Switch (rev bb) 0c:00.0 Ethernet controller: Intel Corporation 82580 Gigabit Network Connection (rev 01) 0c:00.1 Ethernet controller: Intel Corporation 82580 Gigabit Network Connection (rev 01) 0c:00.2 Ethernet controller: Intel Corporation 82580 Gigabit Network Connection (rev 01) 0c:00.3 Ethernet controller: Intel Corporation 82580 Gigabit Network Connection (rev 01) 0f:00.0 Ethernet controller: Intel Corporation 82580 Gigabit Network Connection (rev 01) 0f:00.1 Ethernet controller: Intel Corporation 82580 Gigabit Network Connection (rev 01) 0f:00.2 Ethernet controller: Intel Corporation 82580 Gigabit Network Connection (rev 01) 0f:00.3 Ethernet controller: Intel Corporation 82580 Gigabit Network Connection (rev 01) -------------------------------------- On 8:59 PM, Olivier Hanesse wrote: > Hello > > I've got an issue about time keeping with Xen 4.0 (Debian squeeze release). > > My problem is here (hopefully I amn't the only one, so there might be a bug > somewhere) : http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=599161#50 > After some times, I got this error : Clocksource tsc unstable (delta = > -2999660334211 ns). It has happened on several servers. > > Looking at the output of "xm debug-key s;" > > (XEN) TSC has constant rate, deep Cstates possible, so not reliable, > warp=2850 (count=3) > > I am using a "Intel(R) Xeon(R) CPU L5420 @ 2.50GHz", which has the > "constant_tsc", but not the "nonstop_tsc" one. > On other systems with a newer cpu with "nonstop_tsc", I don't have this > issue (systems are running the same distros with same config). > > I tried to boot with "max_cstate=0", but nothing changed, my TSC isn't > reliable and after some times, I will got the "50min" issue again. > > I don't understand how a system can do a jump of "50min" in the future. Why > 50min ? it is not 40min, not 1 hour, it is always 50min. > I don't know how to make my TSC "reliable" (I already disable everything > about Powerstate in BIOS Settings). > > Any ideas ? > > Regards > From mboxrd@z Thu Jan 1 00:00:00 1970 From: Konrad Rzeszutek Wilk Subject: Re: Xen 4 TSC problems Date: Thu, 15 Sep 2011 04:23:35 -0400 Message-ID: <20110915082335.GB24888@phenom.oracle.com> References: <4E6F034B.5040102@bluewin.ch> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Return-path: Content-Disposition: inline In-Reply-To: <4E6F034B.5040102@bluewin.ch> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Philippe Simonet Cc: xen-devel@lists.xensource.com List-Id: xen-devel@lists.xenproject.org On Tue, Sep 13, 2011 at 09:16:27AM +0200, Philippe Simonet wrote: > Hi Xen developers > > i just would like to inform you that I have exactly the same problem > with Debian squeeze and xen, with > 50 seconds time jump on my dom0 and domu. NTP is running on all > dom0/domuU, clocksource is 'xen' > everywhere. > > some messages : > syslog : > Sep 11 13:56:50 dnsit22 kernel: [571603.359863] Clocksource tsc > unstable (delta = -2999662111513 ns) > > xm dmesg : > ... > (XEN) Platform timer is 14.318MHz HPET > ... > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more times. > (XEN) TSC marked as reliable, warp = 0 (count=2) > ... > > I had some contact with Olivier Hanesse and it indicates that he > doesn't have any solution for this problem, > and all what was proposed in February didn't solved this problem. Which was the max_cstate=0 ? .. > config : > -------------------------------------- > Linux dnsit22.swissptt.ch 2.6.32-5-xen-amd64 #1 SMP Tue Jun 14 > 12:46:30 UTC 2011 x86_64 GNU/Linux > -------------------------------------- > HP DL385 > -------------------------------------- > vendor_id : AuthenticAMD > cpu family : 16 > model : 9 > model name : AMD Opteron(tm) Processor 6174 > stepping : 1 > cpu MHz : 3058776.574 OK, that is really messed up. Your house must be on fire for the machine to be running at 3058GHz! Jeremy, this sounds familiar - did we have a patch for this in your 2.6.32 tree? From mboxrd@z Thu Jan 1 00:00:00 1970 From: Konrad Rzeszutek Wilk Subject: Re: Xen 4 TSC problems Date: Thu, 15 Sep 2011 04:24:02 -0400 Message-ID: <20110915082402.GC24888@phenom.oracle.com> References: <4E6F034B.5040102@bluewin.ch> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Return-path: Content-Disposition: inline In-Reply-To: <4E6F034B.5040102@bluewin.ch> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Philippe Simonet , Jeremy Fitzhardinge Cc: xen-devel@lists.xensource.com List-Id: xen-devel@lists.xenproject.org On Tue, Sep 13, 2011 at 09:16:27AM +0200, Philippe Simonet wrote: > Hi Xen developers Lets try this again, this time Cc-ing Jeremy. > > i just would like to inform you that I have exactly the same problem > with Debian squeeze and xen, with > 50 seconds time jump on my dom0 and domu. NTP is running on all > dom0/domuU, clocksource is 'xen' > everywhere. > > some messages : > syslog : > Sep 11 13:56:50 dnsit22 kernel: [571603.359863] Clocksource tsc > unstable (delta = -2999662111513 ns) > > xm dmesg : > ... > (XEN) Platform timer is 14.318MHz HPET > ... > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more times. > (XEN) TSC marked as reliable, warp = 0 (count=2) > ... > > I had some contact with Olivier Hanesse and it indicates that he > doesn't have any solution for this problem, > and all what was proposed in February didn't solved this problem. Which was the max_cstate=0 ? .. > config : > -------------------------------------- > Linux dnsit22.swissptt.ch 2.6.32-5-xen-amd64 #1 SMP Tue Jun 14 > 12:46:30 UTC 2011 x86_64 GNU/Linux > -------------------------------------- > HP DL385 > -------------------------------------- > vendor_id : AuthenticAMD > cpu family : 16 > model : 9 > model name : AMD Opteron(tm) Processor 6174 > stepping : 1 > cpu MHz : 3058776.574 OK, that is really messed up. Your house must be on fire for the machine to be running at 3058GHz! Jeremy, this sounds familiar - did we have a patch for this in your 2.6.32 tree? From mboxrd@z Thu Jan 1 00:00:00 1970 From: George Dunlap Subject: Re: Xen 4 TSC problems Date: Thu, 15 Sep 2011 11:36:13 +0100 Message-ID: References: <4E6F034B.5040102@bluewin.ch> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: <4E6F034B.5040102@bluewin.ch> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Philippe Simonet Cc: xen-devel@lists.xensource.com List-Id: xen-devel@lists.xenproject.org On Tue, Sep 13, 2011 at 8:16 AM, Philippe Simonet wrote: > Hi Xen developers > > i just would like to inform you that I have exactly the same problem with > Debian squeeze and xen, with > 50 seconds time jump on my dom0 and domu. NTP is running on all dom0/domu= U, > clocksource is 'xen' > everywhere. > > some messages : > syslog : > Sep 11 13:56:50 dnsit22 kernel: [571603.359863] Clocksource tsc unstable > (delta =3D -2999662111513 ns) > > xm dmesg : > ... > (XEN) Platform timer is 14.318MHz HPET > ... > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more time= s. > (XEN) TSC marked as reliable, warp =3D 0 (count=3D2) > ... I haven't been following this conversation, so I don't know if this is relevant, but I've just discovered this morning that the TSC warp check in Xen is done at the wrong time (before any secondary cpus are brought up), and thus always returns warp=3D0. I've submitted a patch to do the check after secondary CPUs are brought up; that should cause Xen to do periodic synchronization of TSCs when there is drift. -George > > I had some contact with Olivier Hanesse and it indicates that he doesn't > have any solution for this problem, > and all what was proposed in February didn't solved this problem. > > all suggestions are welcomed. > > Best regards > > Philippe > > > config : > -------------------------------------- > Linux dnsit22.swissptt.ch 2.6.32-5-xen-amd64 #1 SMP Tue Jun 14 12:46:30 U= TC > 2011 x86_64 GNU/Linux > -------------------------------------- > HP DL385 > -------------------------------------- > vendor_id =A0 =A0 =A0 : AuthenticAMD > cpu family =A0 =A0 =A0: 16 > model =A0 =A0 =A0 =A0 =A0 : 9 > model name =A0 =A0 =A0: AMD Opteron(tm) Processor 6174 > stepping =A0 =A0 =A0 =A0: 1 > cpu MHz =A0 =A0 =A0 =A0 : 3058776.574 > cache size =A0 =A0 =A0: 512 KB > fpu =A0 =A0 =A0 =A0 =A0 =A0 : yes > fpu_exception =A0 : yes > cpuid level =A0 =A0 : 5 > wp =A0 =A0 =A0 =A0 =A0 =A0 =A0: yes > flags =A0 =A0 =A0 =A0 =A0 : fpu de tsc msr pae mce cx8 apic mtrr mca cmov= pat clflush > mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt lm 3dnowext 3dnow > constant_tsc rep_good nonstop_tsc extd_apicid amd_dcm pni cx16 popcnt > hypervisor lahf_lm cmp_legacy extapic cr8_legacy abm sse4a misalignsse > 3dnowprefetch nodeid_msr > bogomips =A0 =A0 =A0 =A0: 4409.03 > TLB size =A0 =A0 =A0 =A0: 1024 4K pages > clflush size =A0 =A0: 64 > cache_alignment : 64 > address sizes =A0 : 48 bits physical, 48 bits virtual > power management: ts ttp tm stc 100mhzsteps hwpstate > -------------------------------------- > > -------------------------------------- > PCI : > 00:00.0 Host bridge: ATI Technologies Inc RD890 Northbridge only dual slo= t > (2x16) PCI-e GFX Hydra part (rev 02) > 00:04.0 PCI bridge: ATI Technologies Inc RD890 PCI to PCI bridge (PCI > express gpp port D) > 00:06.0 PCI bridge: ATI Technologies Inc RD890 PCI to PCI bridge (PCI > express gpp port F) > 00:0a.0 PCI bridge: ATI Technologies Inc RD890 PCI to PCI bridge (externa= l > gfx1 port A) > 00:0b.0 PCI bridge: ATI Technologies Inc RD890 PCI to PCI bridge (NB-SB > link) > 00:0d.0 PCI bridge: ATI Technologies Inc RD890 PCI to PCI bridge (externa= l > gfx1 port B) > 00:11.0 SATA controller: ATI Technologies Inc SB700/SB800 SATA Controller > [IDE mode] > 00:12.0 USB Controller: ATI Technologies Inc SB700/SB800 USB OHCI0 > Controller > 00:12.1 USB Controller: ATI Technologies Inc SB700 USB OHCI1 Controller > 00:12.2 USB Controller: ATI Technologies Inc SB700/SB800 USB EHCI Control= ler > 00:13.0 USB Controller: ATI Technologies Inc SB700/SB800 USB OHCI0 > Controller > 00:13.1 USB Controller: ATI Technologies Inc SB700 USB OHCI1 Controller > 00:13.2 USB Controller: ATI Technologies Inc SB700/SB800 USB EHCI Control= ler > 00:14.0 SMBus: ATI Technologies Inc SBx00 SMBus Controller (rev 3d) > 00:14.1 IDE interface: ATI Technologies Inc SB700/SB800 IDE Controller > 00:14.3 ISA bridge: ATI Technologies Inc SB700/SB800 LPC host controller > 00:14.4 PCI bridge: ATI Technologies Inc SBx00 PCI to PCI Bridge > 00:18.0 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor > HyperTransport Configuration > 00:18.1 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor > Address Map > 00:18.2 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor DR= AM > Controller > 00:18.3 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor > Miscellaneous Control > 00:18.4 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Li= nk > Control > 00:19.0 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor > HyperTransport Configuration > 00:19.1 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor > Address Map > 00:19.2 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor DR= AM > Controller > 00:19.3 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor > Miscellaneous Control > 00:19.4 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Li= nk > Control > 00:1a.0 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor > HyperTransport Configuration > 00:1a.1 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor > Address Map > 00:1a.2 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor DR= AM > Controller > 00:1a.3 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor > Miscellaneous Control > 00:1a.4 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Li= nk > Control > 00:1b.0 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor > HyperTransport Configuration > 00:1b.1 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor > Address Map > 00:1b.2 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor DR= AM > Controller > 00:1b.3 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor > Miscellaneous Control > 00:1b.4 Host bridge: Advanced Micro Devices [AMD] Family 10h Processor Li= nk > Control > 01:03.0 VGA compatible controller: ATI Technologies Inc ES1000 (rev 02) > 02:00.0 System peripheral: Hewlett-Packard Company iLO3 Slave > instrumentation & System support (rev 04) > 02:00.2 System peripheral: Hewlett-Packard Company iLO3 Management Proces= sor > Support and Messaging (rev 04) > 02:00.4 USB Controller: Hewlett-Packard Company Proliant iLO2/iLO3 virtua= l > USB controller (rev 01) > 03:00.0 RAID bus controller: Hewlett-Packard Company Smart Array G6 > controllers (rev 01) > 04:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5709 > Gigabit Ethernet (rev 20) > 04:00.1 Ethernet controller: Broadcom Corporation NetXtreme II BCM5709 > Gigabit Ethernet (rev 20) > 05:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5709 > Gigabit Ethernet (rev 20) > 05:00.1 Ethernet controller: Broadcom Corporation NetXtreme II BCM5709 > Gigabit Ethernet (rev 20) > 09:00.0 PCI bridge: PLX Technology, Inc. PEX 8616 16-lane, 4-Port PCI > Express Gen 2 (5.0 GT/s) Switch (rev bb) > 0a:04.0 PCI bridge: PLX Technology, Inc. PEX 8616 16-lane, 4-Port PCI > Express Gen 2 (5.0 GT/s) Switch (rev bb) > 0a:05.0 PCI bridge: PLX Technology, Inc. PEX 8616 16-lane, 4-Port PCI > Express Gen 2 (5.0 GT/s) Switch (rev bb) > 0a:06.0 PCI bridge: PLX Technology, Inc. PEX 8616 16-lane, 4-Port PCI > Express Gen 2 (5.0 GT/s) Switch (rev bb) > 0c:00.0 Ethernet controller: Intel Corporation 82580 Gigabit Network > Connection (rev 01) > 0c:00.1 Ethernet controller: Intel Corporation 82580 Gigabit Network > Connection (rev 01) > 0c:00.2 Ethernet controller: Intel Corporation 82580 Gigabit Network > Connection (rev 01) > 0c:00.3 Ethernet controller: Intel Corporation 82580 Gigabit Network > Connection (rev 01) > 0f:00.0 Ethernet controller: Intel Corporation 82580 Gigabit Network > Connection (rev 01) > 0f:00.1 Ethernet controller: Intel Corporation 82580 Gigabit Network > Connection (rev 01) > 0f:00.2 Ethernet controller: Intel Corporation 82580 Gigabit Network > Connection (rev 01) > 0f:00.3 Ethernet controller: Intel Corporation 82580 Gigabit Network > Connection (rev 01) > -------------------------------------- > > > > > On 8:59 PM, Olivier Hanesse wrote: >> >> Hello >> >> I've got an issue about time keeping with Xen 4.0 (Debian squeeze >> release). >> >> My problem is here (hopefully I amn't the only one, so there might be a >> bug >> somewhere) : http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=3D599161#5= 0 >> After some times, =A0I got this error : Clocksource tsc unstable (delta = =3D >> -2999660334211 ns). It has happened on several servers. >> >> Looking at the output of "xm debug-key s;" >> >> (XEN) TSC has constant rate, deep Cstates possible, so not reliable, >> warp=3D2850 (count=3D3) >> >> I am using a "Intel(R) Xeon(R) CPU L5420 =A0@ 2.50GHz", which has the >> "constant_tsc", but not the "nonstop_tsc" one. >> On other systems with a newer cpu with "nonstop_tsc", I don't have this >> issue (systems are running the same distros with same config). >> >> I tried to boot with "max_cstate=3D0", but nothing changed, my TSC isn't >> reliable and after some times, I will got the "50min" issue again. >> >> I don't understand how a system can do a jump of "50min" in the future. >> Why >> 50min ? it is not 40min, not 1 hour, it is always 50min. >> I don't know how to make my TSC "reliable" (I already disable everything >> about Powerstate in BIOS Settings). >> >> Any ideas ? >> >> Regards >> > > > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.xensource.com > http://lists.xensource.com/xen-devel > From mboxrd@z Thu Jan 1 00:00:00 1970 From: Jeremy Fitzhardinge Subject: Re: Xen 4 TSC problems Date: Thu, 15 Sep 2011 09:24:43 -0700 Message-ID: <4E7226CB.2020706@goop.org> References: <4E6F034B.5040102@bluewin.ch> <20110915082402.GC24888@phenom.oracle.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <20110915082402.GC24888@phenom.oracle.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Konrad Rzeszutek Wilk Cc: xen-devel@lists.xensource.com, Philippe Simonet List-Id: xen-devel@lists.xenproject.org On 09/15/2011 01:24 AM, Konrad Rzeszutek Wilk wrote: > On Tue, Sep 13, 2011 at 09:16:27AM +0200, Philippe Simonet wrote: >> Hi Xen developers > Lets try this again, this time Cc-ing Jeremy. >> i just would like to inform you that I have exactly the same problem >> with Debian squeeze and xen, with >> 50 seconds time jump on my dom0 and domu. NTP is running on all >> dom0/domuU, clocksource is 'xen' >> everywhere. >> >> some messages : >> syslog : >> Sep 11 13:56:50 dnsit22 kernel: [571603.359863] Clocksource tsc >> unstable (delta = -2999662111513 ns) >> >> xm dmesg : >> ... >> (XEN) Platform timer is 14.318MHz HPET >> ... >> (XEN) Platform timer appears to have unexpectedly wrapped 10 or more times. >> (XEN) TSC marked as reliable, warp = 0 (count=2) >> ... >> >> I had some contact with Olivier Hanesse and it indicates that he >> doesn't have any solution for this problem, >> and all what was proposed in February didn't solved this problem. That looks like Xen itself is having problems keeping track of time. If it can't manage it, then there's not much the guest kernels can do about it. > > Which was the max_cstate=0 ? > .. >> config : >> -------------------------------------- >> Linux dnsit22.swissptt.ch 2.6.32-5-xen-amd64 #1 SMP Tue Jun 14 >> 12:46:30 UTC 2011 x86_64 GNU/Linux >> -------------------------------------- >> HP DL385 >> -------------------------------------- >> vendor_id : AuthenticAMD >> cpu family : 16 >> model : 9 >> model name : AMD Opteron(tm) Processor 6174 >> stepping : 1 >> cpu MHz : 3058776.574 > OK, that is really messed up. Your house must be on fire for the machine > to be running at 3058GHz! > > Jeremy, this sounds familiar - did we have a patch for this in > your 2.6.32 tree? Not that I can think of. All I can suggest from the kernel side is that perhaps some of the ACPI power stuff isn't being set up properly, and that makes the CPU do very strange things with its TSC/power states in general. J From mboxrd@z Thu Jan 1 00:00:00 1970 From: Dan Magenheimer Subject: RE: Xen 4 TSC problems Date: Thu, 15 Sep 2011 11:38:41 -0700 (PDT) Message-ID: <5224b434-5371-404d-8fed-2665e73dacce@default> References: <4E6F034B.5040102@bluewin.ch CAFLBxZZiGxef5CkH9r+VZeB4h7LPA0nJddumwDRovZaL4zOuvA@mail.gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: George Dunlap , Philippe Simonet Cc: jeremy@goop.org, xen-devel@lists.xensource.com, Keir Fraser , Konrad Wilk List-Id: xen-devel@lists.xenproject.org > From: George Dunlap [mailto:George.Dunlap@eu.citrix.com] > Sent: Thursday, September 15, 2011 4:36 AM > To: Philippe Simonet > Cc: xen-devel@lists.xensource.com > Subject: Re: [Xen-devel] Xen 4 TSC problems >=20 > On Tue, Sep 13, 2011 at 8:16 AM, Philippe Simonet > wrote: > > Hi Xen developers > > > > i just would like to inform you that I have exactly the same problem wi= th > > Debian squeeze and xen, with > > 50 seconds time jump on my dom0 and domu. NTP is running on all dom0/do= muU, > > clocksource is 'xen' > > everywhere. > > > > some messages : > > syslog : > > Sep 11 13:56:50 dnsit22 kernel: [571603.359863] Clocksource tsc unstabl= e > > (delta =3D -2999662111513 ns) > > > > xm dmesg : > > ... > > (XEN) Platform timer is 14.318MHz HPET > > ... > > (XEN) Platform timer appears to have unexpectedly wrapped 10 or more ti= mes. > > (XEN) TSC marked as reliable, warp =3D 0 (count=3D2) > > ... >=20 > I haven't been following this conversation, so I don't know if this is > relevant, but I've just discovered this morning that the TSC warp > check in Xen is done at the wrong time (before any secondary cpus are > brought up), and thus always returns warp=3D0. I've submitted a patch > to do the check after secondary CPUs are brought up; that should cause > Xen to do periodic synchronization of TSCs when there is drift. Wow, nice catch, George! I wonder if this is the underlying bug for many of the mysterious time problems that have been reported for a year or two now... at least on certain AMD boxes. Any idea when this was introduced? Or has it always been wrong? Dan From mboxrd@z Thu Jan 1 00:00:00 1970 From: Subject: RE: Xen 4 TSC problems Date: Fri, 16 Sep 2011 06:03:22 +0000 Message-ID: References: <4E6F034B.5040102@bluewin.ch> <20110915082402.GC24888@phenom.oracle.com> <4E7226CB.2020706@goop.org> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: <4E7226CB.2020706@goop.org> Content-Language: en-US List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: jeremy@goop.org, konrad.wilk@oracle.com Cc: xen-devel@lists.xensource.com, philippe.simonet@bluewin.ch List-Id: xen-devel@lists.xenproject.org > -----Original Message----- > From: xen-devel-bounces@lists.xensource.com [mailto:xen-devel- > bounces@lists.xensource.com] On Behalf Of Jeremy Fitzhardinge > Sent: Thursday, September 15, 2011 6:25 PM > To: Konrad Rzeszutek Wilk > Cc: xen-devel@lists.xensource.com; Philippe Simonet > Subject: Re: [Xen-devel] Xen 4 TSC problems >=20 > On 09/15/2011 01:24 AM, Konrad Rzeszutek Wilk wrote: > > On Tue, Sep 13, 2011 at 09:16:27AM +0200, Philippe Simonet wrote: > >> Hi Xen developers > > Lets try this again, this time Cc-ing Jeremy. > >> i just would like to inform you that I have exactly the same problem > >> with Debian squeeze and xen, with > >> 50 seconds time jump on my dom0 and domu. NTP is running on all > >> dom0/domuU, clocksource is 'xen' > >> everywhere. > >> > >> some messages : > >> syslog : > >> Sep 11 13:56:50 dnsit22 kernel: [571603.359863] Clocksource tsc > >> unstable (delta =3D -2999662111513 ns) > >> > >> xm dmesg : > >> ... > >> (XEN) Platform timer is 14.318MHz HPET ... > >> (XEN) Platform timer appears to have unexpectedly wrapped 10 or more > times. > >> (XEN) TSC marked as reliable, warp =3D 0 (count=3D2) ... > >> > >> I had some contact with Olivier Hanesse and it indicates that he > >> doesn't have any solution for this problem, and all what was proposed > >> in February didn't solved this problem. >=20 > That looks like Xen itself is having problems keeping track of time. If = it can't > manage it, then there's not much the guest kernels can do about it. >=20 > > > > Which was the max_cstate=3D0 ? > > .. > >> config : > >> -------------------------------------- > >> Linux dnsit22.swissptt.ch 2.6.32-5-xen-amd64 #1 SMP Tue Jun 14 > >> 12:46:30 UTC 2011 x86_64 GNU/Linux > >> -------------------------------------- > >> HP DL385 > >> -------------------------------------- > >> vendor_id : AuthenticAMD > >> cpu family : 16 > >> model : 9 > >> model name : AMD Opteron(tm) Processor 6174 > >> stepping : 1 > >> cpu MHz : 3058776.574 > > OK, that is really messed up. Your house must be on fire for the > > machine to be running at 3058GHz! > > > > Jeremy, this sounds familiar - did we have a patch for this in your > > 2.6.32 tree? >=20 > Not that I can think of. All I can suggest from the kernel side is that = perhaps > some of the ACPI power stuff isn't being set up properly, and that makes = the > CPU do very strange things with its TSC/power states in general. >=20 how can i detect that ?=20 the /proc/acpi/processor path is empty,=20 find /proc/acpi /proc/acpi /proc/acpi/processor /proc/acpi/button /proc/acpi/button/power /proc/acpi/button/power/PWRF /proc/acpi/button/power/PWRF/info /proc/acpi/thermal_zone /proc/acpi/wakeup /proc/acpi/sleep /proc/acpi/fadt /proc/acpi/dsdt /proc/acpi/info /proc/acpi/power_resource /proc/acpi/embedded_controller dmesg | grep -I acpi [ 1.205647] hpet_acpi_add: no address or irqs in _CRS lsmod | grep -i acpi acpi_processor 5087 1 processor,[permanent] > J >=20 >=20 > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.xensource.com > http://lists.xensource.com/xen-devel From mboxrd@z Thu Jan 1 00:00:00 1970 From: Jeremy Fitzhardinge Subject: Re: Xen 4 TSC problems Date: Fri, 16 Sep 2011 15:40:18 -0700 Message-ID: <4E73D052.3020901@goop.org> References: <4E6F034B.5040102@bluewin.ch> <20110915082402.GC24888@phenom.oracle.com> <4E7226CB.2020706@goop.org> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Philippe.Simonet@swisscom.com Cc: xen-devel@lists.xensource.com, philippe.simonet@bluewin.ch, konrad.wilk@oracle.com List-Id: xen-devel@lists.xenproject.org On 09/15/2011 11:03 PM, Philippe.Simonet@swisscom.com wrote: >> -----Original Message----- >> From: xen-devel-bounces@lists.xensource.com [mailto:xen-devel- >> bounces@lists.xensource.com] On Behalf Of Jeremy Fitzhardinge >> Sent: Thursday, September 15, 2011 6:25 PM >> To: Konrad Rzeszutek Wilk >> Cc: xen-devel@lists.xensource.com; Philippe Simonet >> Subject: Re: [Xen-devel] Xen 4 TSC problems >> >> On 09/15/2011 01:24 AM, Konrad Rzeszutek Wilk wrote: >>> On Tue, Sep 13, 2011 at 09:16:27AM +0200, Philippe Simonet wrote: >>>> Hi Xen developers >>> Lets try this again, this time Cc-ing Jeremy. >>>> i just would like to inform you that I have exactly the same problem >>>> with Debian squeeze and xen, with >>>> 50 seconds time jump on my dom0 and domu. NTP is running on all >>>> dom0/domuU, clocksource is 'xen' >>>> everywhere. >>>> >>>> some messages : >>>> syslog : >>>> Sep 11 13:56:50 dnsit22 kernel: [571603.359863] Clocksource tsc >>>> unstable (delta = -2999662111513 ns) >>>> >>>> xm dmesg : >>>> ... >>>> (XEN) Platform timer is 14.318MHz HPET ... >>>> (XEN) Platform timer appears to have unexpectedly wrapped 10 or more >> times. >>>> (XEN) TSC marked as reliable, warp = 0 (count=2) ... >>>> >>>> I had some contact with Olivier Hanesse and it indicates that he >>>> doesn't have any solution for this problem, and all what was proposed >>>> in February didn't solved this problem. >> That looks like Xen itself is having problems keeping track of time. If it can't >> manage it, then there's not much the guest kernels can do about it. >> >>> Which was the max_cstate=0 ? >>> .. >>>> config : >>>> -------------------------------------- >>>> Linux dnsit22.swissptt.ch 2.6.32-5-xen-amd64 #1 SMP Tue Jun 14 >>>> 12:46:30 UTC 2011 x86_64 GNU/Linux >>>> -------------------------------------- >>>> HP DL385 >>>> -------------------------------------- >>>> vendor_id : AuthenticAMD >>>> cpu family : 16 >>>> model : 9 >>>> model name : AMD Opteron(tm) Processor 6174 >>>> stepping : 1 >>>> cpu MHz : 3058776.574 >>> OK, that is really messed up. Your house must be on fire for the >>> machine to be running at 3058GHz! >>> >>> Jeremy, this sounds familiar - did we have a patch for this in your >>> 2.6.32 tree? >> Not that I can think of. All I can suggest from the kernel side is that perhaps >> some of the ACPI power stuff isn't being set up properly, and that makes the >> CPU do very strange things with its TSC/power states in general. >> > how can i detect that ? > > the /proc/acpi/processor path is empty, > > find /proc/acpi > /proc/acpi > /proc/acpi/processor > /proc/acpi/button > /proc/acpi/button/power > /proc/acpi/button/power/PWRF > /proc/acpi/button/power/PWRF/info > /proc/acpi/thermal_zone > /proc/acpi/wakeup > /proc/acpi/sleep > /proc/acpi/fadt > /proc/acpi/dsdt > /proc/acpi/info > /proc/acpi/power_resource > /proc/acpi/embedded_controller > > dmesg | grep -I acpi > [ 1.205647] hpet_acpi_add: no address or irqs in _CRS > > lsmod | grep -i acpi > acpi_processor 5087 1 processor,[permanent] What does "xenpm start 5" say? J From mboxrd@z Thu Jan 1 00:00:00 1970 From: Philippe Simonet Subject: Re: Xen 4 TSC problems Date: Mon, 19 Sep 2011 07:45:22 +0200 Message-ID: <4E76D6F2.5010602@bluewin.ch> References: <4E6F034B.5040102@bluewin.ch> <20110915082402.GC24888@phenom.oracle.com> <4E7226CB.2020706@goop.org> <4E73D052.3020901@goop.org> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <4E73D052.3020901@goop.org> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Jeremy Fitzhardinge Cc: Philippe.Simonet@swisscom.com, xen-devel@lists.xensource.com, konrad.wilk@oracle.com List-Id: xen-devel@lists.xenproject.org On 9/17/2011 12:40 AM, Jeremy Fitzhardinge wrote: > On 09/15/2011 11:03 PM, Philippe.Simonet@swisscom.com wrote: >>> -----Original Message----- >>> From: xen-devel-bounces@lists.xensource.com [mailto:xen-devel- >>> bounces@lists.xensource.com] On Behalf Of Jeremy Fitzhardinge >>> Sent: Thursday, September 15, 2011 6:25 PM >>> To: Konrad Rzeszutek Wilk >>> Cc: xen-devel@lists.xensource.com; Philippe Simonet >>> Subject: Re: [Xen-devel] Xen 4 TSC problems >>> >>> On 09/15/2011 01:24 AM, Konrad Rzeszutek Wilk wrote: >>>> On Tue, Sep 13, 2011 at 09:16:27AM +0200, Philippe Simonet wrote: >>>>> Hi Xen developers >>>> Lets try this again, this time Cc-ing Jeremy. >>>>> i just would like to inform you that I have exactly the same problem >>>>> with Debian squeeze and xen, with >>>>> 50 seconds time jump on my dom0 and domu. NTP is running on all >>>>> dom0/domuU, clocksource is 'xen' >>>>> everywhere. >>>>> >>>>> some messages : >>>>> syslog : >>>>> Sep 11 13:56:50 dnsit22 kernel: [571603.359863] Clocksource tsc >>>>> unstable (delta = -2999662111513 ns) >>>>> >>>>> xm dmesg : >>>>> ... >>>>> (XEN) Platform timer is 14.318MHz HPET ... >>>>> (XEN) Platform timer appears to have unexpectedly wrapped 10 or more >>> times. >>>>> (XEN) TSC marked as reliable, warp = 0 (count=2) ... >>>>> >>>>> I had some contact with Olivier Hanesse and it indicates that he >>>>> doesn't have any solution for this problem, and all what was proposed >>>>> in February didn't solved this problem. >>> That looks like Xen itself is having problems keeping track of time. If it can't >>> manage it, then there's not much the guest kernels can do about it. >>> >>>> Which was the max_cstate=0 ? >>>> .. >>>>> config : >>>>> -------------------------------------- >>>>> Linux dnsit22.swissptt.ch 2.6.32-5-xen-amd64 #1 SMP Tue Jun 14 >>>>> 12:46:30 UTC 2011 x86_64 GNU/Linux >>>>> -------------------------------------- >>>>> HP DL385 >>>>> -------------------------------------- >>>>> vendor_id : AuthenticAMD >>>>> cpu family : 16 >>>>> model : 9 >>>>> model name : AMD Opteron(tm) Processor 6174 >>>>> stepping : 1 >>>>> cpu MHz : 3058776.574 >>>> OK, that is really messed up. Your house must be on fire for the >>>> machine to be running at 3058GHz! >>>> >>>> Jeremy, this sounds familiar - did we have a patch for this in your >>>> 2.6.32 tree? >>> Not that I can think of. All I can suggest from the kernel side is that perhaps >>> some of the ACPI power stuff isn't being set up properly, and that makes the >>> CPU do very strange things with its TSC/power states in general. >>> >> how can i detect that ? >> >> the /proc/acpi/processor path is empty, >> >> find /proc/acpi >> /proc/acpi >> /proc/acpi/processor >> /proc/acpi/button >> /proc/acpi/button/power >> /proc/acpi/button/power/PWRF >> /proc/acpi/button/power/PWRF/info >> /proc/acpi/thermal_zone >> /proc/acpi/wakeup >> /proc/acpi/sleep >> /proc/acpi/fadt >> /proc/acpi/dsdt >> /proc/acpi/info >> /proc/acpi/power_resource >> /proc/acpi/embedded_controller >> >> dmesg | grep -I acpi >> [ 1.205647] hpet_acpi_add: no address or irqs in _CRS >> >> lsmod | grep -i acpi >> acpi_processor 5087 1 processor,[permanent] > What does "xenpm start 5" say? > > J > here it is : root@dnsit22.swissptt.ch ~# xenpm start 5 Timeout set to 5 seconds Start sampling, waiting for CTRL-C or SIGINT or SIGALARM signal ... Elapsed time (ms): 5028 CPU0: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU1: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU2: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU3: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU4: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU5: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU6: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU7: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU8: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU9: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU10: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU11: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU12: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU13: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU14: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU15: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU16: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU17: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU18: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU19: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU20: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU21: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU22: Residency(ms) Avg Res(ms) Avg freq 18 KHz CPU23: Residency(ms) Avg Res(ms) Avg freq 18 KHz From mboxrd@z Thu Jan 1 00:00:00 1970 From: George Dunlap Subject: Re: Xen 4 TSC problems Date: Mon, 19 Sep 2011 11:39:49 +0100 Message-ID: References: <5224b434-5371-404d-8fed-2665e73dacce@default> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: <5224b434-5371-404d-8fed-2665e73dacce@default> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Dan Magenheimer Cc: Keir Fraser , jeremy@goop.org, xen-devel@lists.xensource.com, Philippe Simonet , Konrad Wilk List-Id: xen-devel@lists.xenproject.org On Thu, Sep 15, 2011 at 7:38 PM, Dan Magenheimer wrote: >> I haven't been following this conversation, so I don't know if this is >> relevant, but I've just discovered this morning that the TSC warp >> check in Xen is done at the wrong time (before any secondary cpus are >> brought up), and thus always returns warp=3D0. =A0I've submitted a patch >> to do the check after secondary CPUs are brought up; that should cause >> Xen to do periodic synchronization of TSCs when there is drift. > > Wow, nice catch, George! =A0I wonder if this is the underlying bug > for many of the mysterious time problems that have been reported > for a year or two now... at least on certain AMD boxes. > Any idea when this was introduced? =A0Or has it always been wrong? Well the comment in 20823:89907dab1aef seems to indicate that's where the "assume it's reliable on AMD until proven otherwise" started; that would be January 2010. I looked as far back as 20705:a74aca4b9386, and there the TSC reliability checks were again in init_xen_time(). Figuring out where things were before then is getting into archeology. :-) The comment at the top of init_xen_time() is correct now, but from the time it was first written through 4.1 is was just plain wrong -- it said init_xen_time() happened after all cpus were up, which has never been true. -George From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Jan Beulich" Subject: Re: Xen 4 TSC problems Date: Thu, 22 Sep 2011 13:07:21 +0100 Message-ID: <4E7B41190200007800057508@nat28.tlf.novell.com> References: <5224b434-5371-404d-8fed-2665e73dacce@default> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: Content-Disposition: inline List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: George Dunlap Cc: jeremy@goop.org, xen-devel@lists.xensource.com, Keir Fraser , Konrad Wilk , Dan Magenheimer , Philippe Simonet List-Id: xen-devel@lists.xenproject.org >>> On 19.09.11 at 12:39, George Dunlap = wrote: > The comment at the top of init_xen_time() is correct now, but from the > time it was first written through 4.1 is was just plain wrong -- it > said init_xen_time() happened after all cpus were up, which has never > been true. Not really - CPUs got booted by that time originally (pre-4.0, which is what the old comment said), but not onlined. Prior to the use of smp_call_function() for the TSC reliability check I assume only CPU feature flags got looked at, which was available at that point prior to Keir's re-work of the SMP boot process. Jan From mboxrd@z Thu Jan 1 00:00:00 1970 From: Subject: RE: Xen 4 TSC problems Date: Fri, 30 Sep 2011 06:33:00 +0000 Message-ID: References: <5224b434-5371-404d-8fed-2665e73dacce@default> Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: Content-Language: en-US List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: George.Dunlap@eu.citrix.com, dan.magenheimer@oracle.com Cc: olivier.hanesse@gmail.com, jeremy@goop.org, xen-devel@lists.xensource.com, keir@xen.org, konrad.wilk@oracle.com List-Id: xen-devel@lists.xenproject.org Hi Xen developpers i need some good tips to go forward with my TSC problem :=20 first fast the problem :=20 - clock jump 50 minutes forward : (xm dmesg) (XEN) TSC is reliable, synchronization unnecessary (XEN) Platform timer is 14.318MHz HPET (XEN) Platform timer appears to have unexpectedly wrapped 10 or more time= s (syslog) Sep 28 17:45:06 dnsit11 kernel: [1970548.356130] Clocksource tsc unstable = (delta =3D -2999660112689 ns) Sep 11 13:56:50 dnsit22 kernel: [571603.359863] Clocksource tsc unstable (= delta =3D -2999662111513 ns) - I can't reproduce or force the problem - on 2 different HP DL 385 G7, with debian squeeze :=20 xen-hypervisor-4.0-amd64 4.0.1-2 dom0 : linux-image-2.6.32-5-xen-amd64 2.6.32-35 domus : 5 -> 15 debian machines 2 * 12-cores AMD Opteron(tm) Processor 6174 - i have this problem since begin of september, before, the machine were ru= nning since 3 month without problem begin of September, I have done an upgrade (dom0 and domus:) linux-image-2.6.32-5-xen-amd64:amd64 (2.6.32-31, automatic) -> linux-imag= e-2.6.32-5-xen-amd64:amd64 (2.6.32-31, 2.6.32-35) - what is strange : (don't know if there is a link with the problem) /proc/cpuinfo in dom0 gives me :=20 cpu MHz : 3249880.888 --or -- cpu MHz : 2300454.255 .... (different after each reboot) =09 in domu thi value is ok(cpu MHz : 2200.112), the bogomips is also = ok (bogomips : 4400.21) if I start the machine with a non-xen environment, the values are also ok =09 I have now exact the same machine where I can make some tests. Could you give me some tips that I could test or implement ? - hardware problem ? hypervisor problem ? dom0 problem ? - try other hypervisor version ?=20 - try linux-image-3.0.0-1-amd64 3.0.0-3 - try reproducing problem ? (how ?, log it ? ....) all your help is welcomed ! many thanks Philippe > -----Original Message----- > From: xen-devel-bounces@lists.xensource.com [mailto:xen-devel- > bounces@lists.xensource.com] On Behalf Of George Dunlap > Sent: Monday, September 19, 2011 12:40 PM > To: Dan Magenheimer > Cc: Keir Fraser; jeremy@goop.org; xen-devel@lists.xensource.com; Philippe > Simonet; Konrad Wilk > Subject: Re: [Xen-devel] Xen 4 TSC problems >=20 > On Thu, Sep 15, 2011 at 7:38 PM, Dan Magenheimer > wrote: > >> I haven't been following this conversation, so I don't know if this > >> is relevant, but I've just discovered this morning that the TSC warp > >> check in Xen is done at the wrong time (before any secondary cpus are > >> brought up), and thus always returns warp=3D0. =A0I've submitted a pat= ch > >> to do the check after secondary CPUs are brought up; that should > >> cause Xen to do periodic synchronization of TSCs when there is drift. > > > > Wow, nice catch, George! =A0I wonder if this is the underlying bug for > > many of the mysterious time problems that have been reported for a > > year or two now... at least on certain AMD boxes. > > Any idea when this was introduced? =A0Or has it always been wrong? >=20 > Well the comment in 20823:89907dab1aef seems to indicate that's where the > "assume it's reliable on AMD until proven otherwise" started; that would = be > January 2010. >=20 > I looked as far back as 20705:a74aca4b9386, and there the TSC reliability > checks were again in init_xen_time(). Figuring out where things were bef= ore > then is getting into archeology. :-) >=20 > The comment at the top of init_xen_time() is correct now, but from the ti= me > it was first written through 4.1 is was just plain wrong -- it said > init_xen_time() happened after all cpus were up, which has never been tru= e. >=20 > -George >=20 > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.xensource.com > http://lists.xensource.com/xen-devel From mboxrd@z Thu Jan 1 00:00:00 1970 From: tommics Subject: RE: Xen 4 TSC problems Date: Fri, 30 Sep 2011 02:36:25 -0700 (PDT) Message-ID: <1317375385187-4856420.post@n5.nabble.com> References: <4E6F034B.5040102@bluewin.ch> <5224b434-5371-404d-8fed-2665e73dacce@default> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: xen-devel@lists.xensource.com List-Id: xen-devel@lists.xenproject.org Hey there, just wanted to report that we also experience the same problem since upgrading to xen 4.0.1 on debian squeeze. We are running latest debian stable release. ii libxenstore3.0 4.0.1-2 =20 ii linux-image-2.6.32-5-xen-amd64 2.6.32-35squeeze2 ii linux-image-xen-amd64 2.6.32+29 =20 ii xen-hypervisor-4.0-amd64 4.0.1-2 =20 ii xen-linux-system-2.6-xen-amd64 2.6.32+29 =20 ii xen-linux-system-2.6.32-5-xen-amd64 2.6.32-35squeeze2 ii xen-qemu-dm-4.0 4.0.1-2 =20 ii xen-tools 4.2-1 =20 ii xen-utils-4.0 4.0.1-2 =20 ii xen-utils-common 4.0.0-1 =20 ii xenstore-utils 4.0.1-2 =20 We also have the clock jumping 50 minutes into future.=20 We are running IBM Blades HS21 XM (Type 7995) with Intel(R) Xeon(R) CPU E5345 @ 2.33GHz. We are also running the same configuration on another machine with Intel(R) Xeon(R) CPU X7550 @ 2.00GHz where we dont experience this problems. Also w= e didnt had those bugs running xen 3.1.0 with 2.6.18-5-xen-amd64 kernel. We currently running one blade with disabled HPET, clocksoure=3Dpit and cpuidle=3D0 and another with HPET on and nothing else configured. The main problem debugging this, is to wait for the next error to appear. Until now both machines run fine without time jumping, but we did see that time jump on the machine which is running with hpet enabled and no other settings.=20 We could help debugging here. Regards Thomas P=C3=B6hler -- View this message in context: http://xen.1045712.n5.nabble.com/Xen-4-TSC-pr= oblems-tp3396848p4856420.html Sent from the Xen - Dev mailing list archive at Nabble.com. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Dan Magenheimer Subject: RE: Xen 4 TSC problems Date: Fri, 30 Sep 2011 10:16:44 -0700 (PDT) Message-ID: References: <5224b434-5371-404d-8fed-2665e73dacce@default> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: Philippe.Simonet@swisscom.com, George.Dunlap@eu.citrix.com Cc: olivier.hanesse@gmail.com, jeremy@goop.org, xen-devel@lists.xensource.com, keir@xen.org, Konrad Wilk List-Id: xen-devel@lists.xenproject.org > From: Philippe.Simonet@swisscom.com [mailto:Philippe.Simonet@swisscom.com= ] > Subject: RE: [Xen-devel] Xen 4 TSC problems >=20 > Hi Xen developpers >=20 > i need some good tips to go forward with my TSC problem : >=20 > Could you give me some tips that I could test or implement ? > =09- try other hypervisor version ? Hi Phillipe -- It would definitely be worthwhile to see if you can reproduce the problem on the latest xen-unstable bits. (Please make sure that the bug George reported below is fixed in your build.) A lot has changed since 4.0.1. Dan P.S. I will be mostly offline for the next week or so... > > -----Original Message----- > > From: xen-devel-bounces@lists.xensource.com [mailto:xen-devel- > > bounces@lists.xensource.com] On Behalf Of George Dunlap > > Sent: Monday, September 19, 2011 12:40 PM > > To: Dan Magenheimer > > Cc: Keir Fraser; jeremy@goop.org; xen-devel@lists.xensource.com; Philip= pe > > Simonet; Konrad Wilk > > Subject: Re: [Xen-devel] Xen 4 TSC problems > > > > On Thu, Sep 15, 2011 at 7:38 PM, Dan Magenheimer > > wrote: > > >> I haven't been following this conversation, so I don't know if this > > >> is relevant, but I've just discovered this morning that the TSC warp > > >> check in Xen is done at the wrong time (before any secondary cpus ar= e > > >> brought up), and thus always returns warp=3D0. =A0I've submitted a p= atch > > >> to do the check after secondary CPUs are brought up; that should > > >> cause Xen to do periodic synchronization of TSCs when there is drift= . > > > > > > Wow, nice catch, George! =A0I wonder if this is the underlying bug fo= r > > > many of the mysterious time problems that have been reported for a > > > year or two now... at least on certain AMD boxes. > > > Any idea when this was introduced? =A0Or has it always been wrong? > > > > Well the comment in 20823:89907dab1aef seems to indicate that's where t= he > > "assume it's reliable on AMD until proven otherwise" started; that woul= d be > > January 2010. > > > > I looked as far back as 20705:a74aca4b9386, and there the TSC reliabili= ty > > checks were again in init_xen_time(). Figuring out where things were b= efore > > then is getting into archeology. :-) > > > > The comment at the top of init_xen_time() is correct now, but from the = time > > it was first written through 4.1 is was just plain wrong -- it said > > init_xen_time() happened after all cpus were up, which has never been t= rue. > > > > -George > > > > _______________________________________________ > > Xen-devel mailing list > > Xen-devel@lists.xensource.com > > http://lists.xensource.com/xen-devel From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-devel] Xen 4 TSC problems Date: Thu, 27 Sep 2012 17:54:48 +0200 Message-ID: References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xen.org Errors-To: xen-users-bounces@lists.xen.org To: Olivier Hanesse Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Keir Fraser , Jeremy Fitzhardinge , Jan Beulich , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 28 February 2011 16:54, Olivier Hanesse wrote: > Yes this is what I mean. > I am glad to hear that it isn't a bad sign :) > I thought of a bad sign, because on system with "reliable TSC", this counter > is always 0. Hey men. I have exactly the same problem. I have two cluster nodes. Server are two HP Proliant DL 580 G4 with four Quad Core Intel(R) Xeon(R) CPU E7330 @ 2.40GHz. I'm running debian squeeze in dom0s end domUs. xm info: host : xen-p01 release : 2.6.32-5-xen-amd64 version : #1 SMP Sun May 6 08:57:29 UTC 2012 machine : x86_64 nr_cpus : 16 nr_nodes : 1 cores_per_socket : 4 threads_per_core : 1 cpu_mhz : 2400 hw_caps : bfebfbff:20100800:00000000:00000940:0004e3bd:00000000:00000001:00000000 virt_caps : hvm total_memory : 65532 free_memory : 40317 node_to_cpu : node0:0-15 node_to_memory : node0:40317 node_to_dma32_mem : node0:3256 max_node_id : 0 xen_major : 4 xen_minor : 0 xen_extra : .1 xen_caps : xen-3.0-x86_64 xen-3.0-x86_32p hvm-3.0-x86_32 hvm-3.0-x86_32p hvm-3.0-x86_64 xen_scheduler : credit xen_pagesize : 4096 platform_params : virt_start=0xffff800000000000 xen_changeset : unavailable xen_commandline : placeholder dom0_mem=3072M loglvl=warning guest_loglvl=warning cc_compiler : gcc version 4.4.5 (Debian 4.4.5-8) cc_compile_by : ultrotter cc_compile_domain : debian.org cc_compile_date : Sat Sep 8 19:15:46 UTC 2012 xend_config_format : 4 I'm experiencing weekly a clock jump ahead of about 50 minutes on dom0. I'm seriously in trouble because it cause every time a reboot of one of the two nodes clusters. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Dan Magenheimer Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Thu, 27 Sep 2012 12:27:53 -0700 (PDT) Message-ID: <65395f62-74e5-4910-b701-8df629c2ce3b@default> References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro , Olivier Hanesse Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org > From: Mauro [mailto:mrsanna1@gmail.com] > Sent: Thursday, September 27, 2012 9:55 AM > To: Olivier Hanesse > Cc: Dan Magenheimer; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Keir Fraser; Jan Beulich; > Keir Fraser; Xen Users; Mark Adams > Subject: Re: [Xen-users] Re: [Xen-devel] Xen 4 TSC problems > > On 28 February 2011 16:54, Olivier Hanesse wrote: > > Yes this is what I mean. > > I am glad to hear that it isn't a bad sign :) > > I thought of a bad sign, because on system with "reliable TSC", this counter > > is always 0. > > Hey men. > I have exactly the same problem. > I have two cluster nodes. > Server are two HP Proliant DL 580 G4 with four Quad Core Intel(R) > Xeon(R) CPU E7330 @ 2.40GHz. > I'm running debian squeeze in dom0s end domUs. Hi Mauro -- There's been a lot of work on clocks since 4.0 (by other Xen developers, not me). I don't think this specific problem was ever reproduced by a developer so I don't think anyone knows if it has been already fixed or not, nor are there any plans to backport all the timer work to 4.0. You might try upgrading your Xen hypervisor to the just-released Xen 4.2 [1] and see if the problem goes away. If the problem still exists in 4.2, it may be easier to get some developer to pay attention to it. It may be specific hardware or processors or power management or firmware or even dom0 kernel, so the first thing to do is try later hypervisor bits. Sorry I can't be more helpful. Good luck! Dan [1] Sorry, I'm not familiar with the 4.0->4.2 upgrade process so you may want to confirm with others. > xm info: > > host : xen-p01 > release : 2.6.32-5-xen-amd64 > version : #1 SMP Sun May 6 08:57:29 UTC 2012 > machine : x86_64 > nr_cpus : 16 > nr_nodes : 1 > cores_per_socket : 4 > threads_per_core : 1 > cpu_mhz : 2400 > hw_caps : > bfebfbff:20100800:00000000:00000940:0004e3bd:00000000:00000001:00000000 > virt_caps : hvm > total_memory : 65532 > free_memory : 40317 > node_to_cpu : node0:0-15 > node_to_memory : node0:40317 > node_to_dma32_mem : node0:3256 > max_node_id : 0 > xen_major : 4 > xen_minor : 0 > xen_extra : .1 > xen_caps : xen-3.0-x86_64 xen-3.0-x86_32p hvm-3.0-x86_32 > hvm-3.0-x86_32p hvm-3.0-x86_64 > xen_scheduler : credit > xen_pagesize : 4096 > platform_params : virt_start=0xffff800000000000 > xen_changeset : unavailable > xen_commandline : placeholder dom0_mem=3072M loglvl=warning > guest_loglvl=warning > cc_compiler : gcc version 4.4.5 (Debian 4.4.5-8) > cc_compile_by : ultrotter > cc_compile_domain : debian.org > cc_compile_date : Sat Sep 8 19:15:46 UTC 2012 > xend_config_format : 4 > > I'm experiencing weekly a clock jump ahead of about 50 minutes on dom0. > I'm seriously in trouble because it cause every time a reboot of one > of the two nodes clusters. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Thu, 27 Sep 2012 23:28:36 +0200 Message-ID: References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> <65395f62-74e5-4910-b701-8df629c2ce3b@default> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1496806874817463489==" Return-path: In-Reply-To: <65395f62-74e5-4910-b701-8df629c2ce3b@default> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Dan Magenheimer Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Keir Fraser , Mauro , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org --===============1496806874817463489== Content-Type: multipart/alternative; boundary=e89a8f64347c3ab36704cab59f37 --e89a8f64347c3ab36704cab59f37 Content-Type: text/plain; charset=ISO-8859-1 Hello, >>From my point of view, this was a kind of xen hardware "incompatibility/bug" : I was able to reproduce this bug on more than 50 identical servers, but not on another farm of servers with a different hardware. Xen version, Debian Kernel was exactly the same on both farm. Regards Olivier 2012/9/27 Dan Magenheimer > > From: Mauro [mailto:mrsanna1@gmail.com] > > Sent: Thursday, September 27, 2012 9:55 AM > > To: Olivier Hanesse > > Cc: Dan Magenheimer; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; > Keir Fraser; Jan Beulich; > > Keir Fraser; Xen Users; Mark Adams > > Subject: Re: [Xen-users] Re: [Xen-devel] Xen 4 TSC problems > > > > On 28 February 2011 16:54, Olivier Hanesse > wrote: > > > Yes this is what I mean. > > > I am glad to hear that it isn't a bad sign :) > > > I thought of a bad sign, because on system with "reliable TSC", this > counter > > > is always 0. > > > > Hey men. > > I have exactly the same problem. > > I have two cluster nodes. > > Server are two HP Proliant DL 580 G4 with four Quad Core Intel(R) > > Xeon(R) CPU E7330 @ 2.40GHz. > > I'm running debian squeeze in dom0s end domUs. > > Hi Mauro -- > > There's been a lot of work on clocks since 4.0 (by other Xen developers, > not me). I don't think this specific problem was ever reproduced > by a developer so I don't think anyone knows if it has been > already fixed or not, nor are there any plans to backport all the > timer work to 4.0. > > You might try upgrading your Xen hypervisor to the just-released > Xen 4.2 [1] and see if the problem goes away. If the problem still > exists in 4.2, it may be easier to get some developer to pay attention > to it. It may be specific hardware or processors or power > management or firmware or even dom0 kernel, so the first thing > to do is try later hypervisor bits. > > Sorry I can't be more helpful. Good luck! > > Dan > > [1] Sorry, I'm not familiar with the 4.0->4.2 upgrade process > so you may want to confirm with others. > > > xm info: > > > > host : xen-p01 > > release : 2.6.32-5-xen-amd64 > > version : #1 SMP Sun May 6 08:57:29 UTC 2012 > > machine : x86_64 > > nr_cpus : 16 > > nr_nodes : 1 > > cores_per_socket : 4 > > threads_per_core : 1 > > cpu_mhz : 2400 > > hw_caps : > > bfebfbff:20100800:00000000:00000940:0004e3bd:00000000:00000001:00000000 > > virt_caps : hvm > > total_memory : 65532 > > free_memory : 40317 > > node_to_cpu : node0:0-15 > > node_to_memory : node0:40317 > > node_to_dma32_mem : node0:3256 > > max_node_id : 0 > > xen_major : 4 > > xen_minor : 0 > > xen_extra : .1 > > xen_caps : xen-3.0-x86_64 xen-3.0-x86_32p hvm-3.0-x86_32 > > hvm-3.0-x86_32p hvm-3.0-x86_64 > > xen_scheduler : credit > > xen_pagesize : 4096 > > platform_params : virt_start=0xffff800000000000 > > xen_changeset : unavailable > > xen_commandline : placeholder dom0_mem=3072M loglvl=warning > > guest_loglvl=warning > > cc_compiler : gcc version 4.4.5 (Debian 4.4.5-8) > > cc_compile_by : ultrotter > > cc_compile_domain : debian.org > > cc_compile_date : Sat Sep 8 19:15:46 UTC 2012 > > xend_config_format : 4 > > > > I'm experiencing weekly a clock jump ahead of about 50 minutes on dom0. > > I'm seriously in trouble because it cause every time a reboot of one > > of the two nodes clusters. > --e89a8f64347c3ab36704cab59f37 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hello,=A0

From my point of view, this was a kind of xen = hardware "incompatibility/bug" : I was able to reproduce this bug= on more than 50 identical servers, but not on another farm of servers with= a different hardware.
Xen version, Debian Kernel was exactly the same on both farm.

Regards

Olivier

2012/9/27 Dan Magenheimer <dan.magenheime= r@oracle.com>
> From: Mauro [mailto:mrsanna1@gmail.com]
> Sent: Thursday, September 27, 2012 9:55 AM
> To: Olivier Hanesse
> Cc: Dan Magenheimer; Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Keir Fraser; Jan Be= ulich;
> Keir Fraser; Xen Users; Mark Adams
> Subject: Re: [Xen-users] Re: [Xen-devel] Xen 4 TSC problems
>
> On 28 February 2011 16:54, Olivier Hanesse <olivier.hanesse@gmail.com> wrote:
> > Yes this is what I mean.
> > I am glad to hear that it isn't a bad sign :)
> > I thought of a bad sign, because on system with "reliable TS= C", this counter
> > is always 0.
>
> Hey men.
> I have exactly the same problem.
> I have two cluster nodes.
> Server are two HP Proliant DL 580 G4 with four Quad Core Intel(R)
> Xeon(R) CPU E7330 =A0@ 2.40GHz.
> I'm running debian squeeze in dom0s end domUs.

Hi Mauro --

There's been a lot of work on clocks since 4.0 (by other Xen developers= ,
not me). =A0I don't think this specific problem was ever reproduced
by a developer so I don't think anyone knows if it has been
already fixed or not, nor are there any plans to backport all the
timer work to 4.0.

You might try upgrading your Xen hypervisor to the just-released
Xen 4.2 [1] and see if the problem goes away. =A0If the problem still
exists in 4.2, it may be easier to get some developer to pay attention
to it. =A0It may be specific hardware or processors or power
management or firmware or even dom0 kernel, so the first thing
to do is try later hypervisor bits.

Sorry I can't be more helpful. =A0Good luck!

Dan

[1] Sorry, I'm not familiar with the 4.0->4.2 upgrade process
so you may want to confirm with others.

> xm info:
>
> host =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 : xen-p01
> release =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0: 2.6.32-5-xen-amd64
> version =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0: #1 SMP Sun May 6 08:57:29 UTC= 2012
> machine =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0: x86_64
> nr_cpus =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0: 16
> nr_nodes =A0 =A0 =A0 =A0 =A0 =A0 =A0 : 1
> cores_per_socket =A0 =A0 =A0 : 4
> threads_per_core =A0 =A0 =A0 : 1
> cpu_mhz =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0: 2400
> hw_caps =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0:
> bfebfbff:20100800:00000000:00000940:0004e3bd:00000000:00000001:0000000= 0
> virt_caps =A0 =A0 =A0 =A0 =A0 =A0 =A0: hvm
> total_memory =A0 =A0 =A0 =A0 =A0 : 65532
> free_memory =A0 =A0 =A0 =A0 =A0 =A0: 40317
> node_to_cpu =A0 =A0 =A0 =A0 =A0 =A0: node0:0-15
> node_to_memory =A0 =A0 =A0 =A0 : node0:40317
> node_to_dma32_mem =A0 =A0 =A0: node0:3256
> max_node_id =A0 =A0 =A0 =A0 =A0 =A0: 0
> xen_major =A0 =A0 =A0 =A0 =A0 =A0 =A0: 4
> xen_minor =A0 =A0 =A0 =A0 =A0 =A0 =A0: 0
> xen_extra =A0 =A0 =A0 =A0 =A0 =A0 =A0: .1
> xen_caps =A0 =A0 =A0 =A0 =A0 =A0 =A0 : xen-3.0-x86_64 xen-3.0-x86_32p = hvm-3.0-x86_32
> hvm-3.0-x86_32p hvm-3.0-x86_64
> xen_scheduler =A0 =A0 =A0 =A0 =A0: credit
> xen_pagesize =A0 =A0 =A0 =A0 =A0 : 4096
> platform_params =A0 =A0 =A0 =A0: virt_start=3D0xffff800000000000
> xen_changeset =A0 =A0 =A0 =A0 =A0: unavailable
> xen_commandline =A0 =A0 =A0 =A0: placeholder dom0_mem=3D3072M loglvl= =3Dwarning
> guest_loglvl=3Dwarning
> cc_compiler =A0 =A0 =A0 =A0 =A0 =A0: gcc version 4.4.5 (Debian 4.4.5-8= )
> cc_compile_by =A0 =A0 =A0 =A0 =A0: ultrotter
> cc_compile_domain =A0 =A0 =A0: debian.org
> cc_compile_date =A0 =A0 =A0 =A0: Sat Sep =A08 19:15:46 UTC 2012
> xend_config_format =A0 =A0 : 4
>
> I'm experiencing weekly a clock jump ahead of about 50 minutes on = dom0.
> I'm seriously in trouble because it cause every time a reboot of o= ne
> of the two nodes clusters.

--e89a8f64347c3ab36704cab59f37-- --===============1496806874817463489== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xen.org http://lists.xen.org/xen-devel --===============1496806874817463489==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-devel] Xen 4 TSC problems Date: Thu, 27 Sep 2012 23:42:04 +0200 Message-ID: References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> <65395f62-74e5-4910-b701-8df629c2ce3b@default> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xen.org Errors-To: xen-users-bounces@lists.xen.org To: Olivier Hanesse Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Keir Fraser , Jeremy Fitzhardinge , Jan Beulich , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 27 September 2012 23:28, Olivier Hanesse wrote: > Hello, > > From my point of view, this was a kind of xen hardware "incompatibility/bug" > : I was able to reproduce this bug on more than 50 identical servers, but > not on another farm of servers with a different hardware. > Xen version, Debian Kernel was exactly the same on both farm. Yes I think so. The problem is where I use debian squeeze with xen 4.0. In another server with the same hardware but with debian lenny and xen 3.0 I have no problems. I've read that a workaround is to set clocksource=pit on the xen boot line in the grub conf, I hope this works because I can't change hardware. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Sat, 29 Sep 2012 10:08:45 +0200 Message-ID: References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> <65395f62-74e5-4910-b701-8df629c2ce3b@default> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0135414908100069518==" Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Keir Fraser , Jeremy Fitzhardinge , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org --===============0135414908100069518== Content-Type: multipart/alternative; boundary=e89a8f23585f6dc74e04cad2ae5f --e89a8f23585f6dc74e04cad2ae5f Content-Type: text/plain; charset=ISO-8859-1 It didn't work for me :( clocksource=pit made another "time jump" (don't remember how much, but it was worst than 50min) 2012/9/27 Mauro > On 27 September 2012 23:28, Olivier Hanesse > wrote: > > Hello, > > > > From my point of view, this was a kind of xen hardware > "incompatibility/bug" > > : I was able to reproduce this bug on more than 50 identical servers, but > > not on another farm of servers with a different hardware. > > Xen version, Debian Kernel was exactly the same on both farm. > > Yes I think so. > The problem is where I use debian squeeze with xen 4.0. > In another server with the same hardware but with debian lenny and xen > 3.0 I have no problems. > I've read that a workaround is to set clocksource=pit on the xen boot > line in the grub conf, I hope this works because I can't change hardware. > --e89a8f23585f6dc74e04cad2ae5f Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable It didn't work for me :(
clocksource=3Dpit made another "time = jump" (don't remember how much, but it was worst than 50min)
2012/9/27 Mauro <mrsanna1@gmail.com>=
On 27 September 2012 23:28= , Olivier Hanesse <olivier.= hanesse@gmail.com> wrote:
> Hello,
>
> From my point of view, this was a kind of xen hardware "incompati= bility/bug"
> : I was able to reproduce this bug on more than 50 identical servers, = but
> not on another farm of servers with a different hardware.
> Xen version, Debian Kernel was exactly the same on both farm.

Yes I think so.
The problem is where I use debian squeeze with xen 4.0.
In another server with the same hardware but with debian lenny and xen
3.0 I have no problems.
I've read that a workaround is to set clocksource=3Dpit on the xen boot=
line in the grub conf, I hope this works because I can't change h= ardware.

--e89a8f23585f6dc74e04cad2ae5f-- --===============0135414908100069518== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xen.org http://lists.xen.org/xen-devel --===============0135414908100069518==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-devel] Xen 4 TSC problems Date: Sat, 29 Sep 2012 11:41:39 +0200 Message-ID: References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> <65395f62-74e5-4910-b701-8df629c2ce3b@default> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xen.org Errors-To: xen-users-bounces@lists.xen.org To: Olivier Hanesse Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Keir Fraser , Jeremy Fitzhardinge , Jan Beulich , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 29 September 2012 10:08, Olivier Hanesse wrote: > It didn't work for me :( > clocksource=pit made another "time jump" (don't remember how much, but it > was worst than 50min) Damn.........so there isn't a solution, it is a huge problem. What processors do you have? I have HP Proliant DL580 G5 systems with four quad core Intel(R) Xeon(R) CPU E7330 @ 2.40GHz and debian linux as s.o. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-devel] Xen 4 TSC problems Date: Sat, 29 Sep 2012 14:19:55 +0200 Message-ID: References: <68c41dd8-9195-41b0-83d7-9242b8eff809@default> <65395f62-74e5-4910-b701-8df629c2ce3b@default> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xen.org Errors-To: xen-users-bounces@lists.xen.org To: Olivier Hanesse Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Keir Fraser , Jeremy Fitzhardinge , Jan Beulich , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org It's happened another time, system date 50 minutes ahead. There is really no solution? root@xen-p02:~# date sab 29 set 2012, 15.06.25, CEST root@xen-p02:~# hwclock --debug hwclock from util-linux-ng 2.17.2 Using /dev interface to clock. Last drift adjustment done at 1348816781 seconds after 1969 Last calibration done at 1348816781 seconds after 1969 Hardware clock is on UTC time Assuming hardware clock is kept in UTC time. Waiting for clock tick... ...got clock tick Time read from Hardware Clock: 2012/09/29 12:16:58 Hw clock time : 2012/09/29 12:16:58 = 1348921018 seconds since 1969 sab 29 set 2012 14:16:58 CEST -0.751536 seconds root@xen-p02:~# hwclock --show sab 29 set 2012 14:17:12 CEST -0.423643 seconds From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-devel] Xen 4 TSC problems Date: Sat, 29 Sep 2012 17:13:08 +0200 Message-ID: References: <4D655A39.2040501@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xen.org Errors-To: xen-users-bounces@lists.xen.org To: Keir Fraser Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Jeremy Fitzhardinge , Olivier Hanesse , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org T24gMjQgRmVicnVhcnkgMjAxMSAwODoxNiwgS2VpciBGcmFzZXIgPGtlaXIueGVuQGdtYWlsLmNv bT4gd3JvdGU6Cj4gUGxlYXNlIHNlbmQgWGVuIGJvb3Qgb3V0cHV0ICh4bSBkbWVzZykuIEdldHRp bmcgaXQgZnJvbSBYZW4gMy4yIGFzIHdlbGwKPiB3b3VsZCBiZSBpbnRlcmVzdGluZywgaWYgeW91 IHN0aWxsIGhhdmUgaXQgaW5zdGFsbGVkIG9uIGFueSBvZiB0aGVzZQo+IG1hY2hpbmVzLgoKSWYg aXQgY2FuIGJlIHVzZWZ1bCBoZXJlIGlzIHhtIGRtZXNmIG9mIHhlbiA0LjAgb24gYSBkZWJpYW4g c3F1ZWV6ZSBzeXN0ZW06CgooWEVOKSBYZW4gdmVyc2lvbiA0LjAuMSAoRGViaWFuIDQuMC4xLTUu NCkgKHVsdHJvdHRlckBkZWJpYW4ub3JnKSAoZ2NjCnZlcnNpb24gNC40LjUgKERlYmlhbiA0LjQu NS04KSApIFNhdCBTZXAgIDggMTk6MTU6NDYgVVRDIDIwMTIKKFhFTikgQm9vdGxvYWRlcjogR1JV QiAxLjk4KzIwMTAwODA0LTE0K3NxdWVlemUxCihYRU4pIENvbW1hbmQgbGluZTogcGxhY2Vob2xk ZXIgZG9tMF9tZW09MzA3Mk0gbG9nbHZsPXdhcm5pbmcKZ3Vlc3RfbG9nbHZsPXdhcm5pbmcKKFhF TikgVmlkZW8gaW5mb3JtYXRpb246CihYRU4pICBWR0EgaXMgdGV4dCBtb2RlIDgweDI1LCBmb250 IDh4MTYKKFhFTikgIFZCRS9EREMgbWV0aG9kczogVjI7IEVESUQgdHJhbnNmZXIgdGltZTogMiBz ZWNvbmRzCihYRU4pIERpc2MgaW5mb3JtYXRpb246CihYRU4pICBGb3VuZCAyIE1CUiBzaWduYXR1 cmVzCihYRU4pICBGb3VuZCAyIEVERCBpbmZvcm1hdGlvbiBzdHJ1Y3R1cmVzCihYRU4pIFhlbi1l ODIwIFJBTSBtYXA6CihYRU4pICAwMDAwMDAwMDAwMDAwMDAwIC0gMDAwMDAwMDAwMDA5ZjQwMCAo dXNhYmxlKQooWEVOKSAgMDAwMDAwMDAwMDA5ZjQwMCAtIDAwMDAwMDAwMDAwYTAwMDAgKHJlc2Vy dmVkKQooWEVOKSAgMDAwMDAwMDAwMDBmMDAwMCAtIDAwMDAwMDAwMDAxMDAwMDAgKHJlc2VydmVk KQooWEVOKSAgMDAwMDAwMDAwMDEwMDAwMCAtIDAwMDAwMDAwY2ZkNDMwMDAgKHVzYWJsZSkKKFhF TikgIDAwMDAwMDAwY2ZkNDMwMDAgLSAwMDAwMDAwMGNmZDRjMDAwIChBQ1BJIGRhdGEpCihYRU4p ICAwMDAwMDAwMGNmZDRjMDAwIC0gMDAwMDAwMDBjZmQ0ZDAwMCAodXNhYmxlKQooWEVOKSAgMDAw MDAwMDBjZmQ0ZDAwMCAtIDAwMDAwMDAwZDAwMDAwMDAgKHJlc2VydmVkKQooWEVOKSAgMDAwMDAw MDBlMDAwMDAwMCAtIDAwMDAwMDAwZjAwMDAwMDAgKHJlc2VydmVkKQooWEVOKSAgMDAwMDAwMDBm ZWMwMDAwMCAtIDAwMDAwMDAwZmVkMDAwMDAgKHJlc2VydmVkKQooWEVOKSAgMDAwMDAwMDBmZWUw MDAwMCAtIDAwMDAwMDAwZmVlMTAwMDAgKHJlc2VydmVkKQooWEVOKSAgMDAwMDAwMDBmZmMwMDAw MCAtIDAwMDAwMDAxMDAwMDAwMDAgKHJlc2VydmVkKQooWEVOKSAgMDAwMDAwMDEwMDAwMDAwMCAt IDAwMDAwMDEwMmZmZmYwMDAgKHVzYWJsZSkKKFhFTikgQUNQSTogUlNEUCAwMDBGNEYyMCwgMDAy NCAocjIgSFAgICAgKQooWEVOKSBBQ1BJOiBYU0RUIENGRDQzOTAwLCAwMDdDIChyMSBIUCAgICAg UHJvTGlhbnQgICAgICAgIDIgICDvv70gICAgIDE2MkUpCihYRU4pIEFDUEk6IEZBQ1AgQ0ZENDM5 QzAsIDAwRjQgKHIzIEhQICAgICBQcm9MaWFudCAgICAgICAgMiAgIO+/vSAgICAgMTYyRSkKKFhF TikgQUNQSTogRFNEVCBDRkQ0M0FDMCwgMzBDOSAocjEgSFAgICAgICAgICBEU0RUICAgICAgICAx IElOVEwgMjAwMzAyMjgpCihYRU4pIEFDUEk6IEZBQ1MgQ0ZENDMxMDAsIDAwNDAKKFhFTikgQUNQ STogU1BDUiBDRkQ0MzE0MCwgMDA1MCAocjEgSFAgICAgIFNQQ1JSQlNVICAgICAgICAxICAg77+9 ICAgICAxNjJFKQooWEVOKSBBQ1BJOiBNQ0ZHIENGRDQzMUMwLCAwMDNDIChyMSBIUCAgICAgUHJv TGlhbnQgICAgICAgIDEgICAgICAgICAgICAgMCkKKFhFTikgQUNQSTogSFBFVCBDRkQ0MzIwMCwg MDAzOCAocjEgSFAgICAgIFByb0xpYW50ICAgICAgICAyICAg77+9ICAgICAxNjJFKQooWEVOKSBB Q1BJOiBGRkZGIENGRDQzMjQwLCAwMDY0IChyMiBIUCAgICAgUDYxICAgICAgICAgICAgIDIgICDv v70gICAgIDE2MkUpCihYRU4pIEFDUEk6IFNQTUkgQ0ZENDMyQzAsIDAwNDAgKHI1IEhQICAgICBQ cm9MaWFudCAgICAgICAgMSAgIO+/vSAgICAgMTYyRSkKKFhFTikgQUNQSTogRVJTVCBDRkQ0MzMw MCwgMDFEMCAocjEgSFAgICAgIFByb0xpYW50ICAgICAgICAxICAg77+9ICAgICAxNjJFKQooWEVO KSBBQ1BJOiBBUElDIENGRDQzNTAwLCAwMTc2IChyMSBIUCAgICAgUHJvTGlhbnQgICAgICAgIDIg ICAgICAgICAgICAgMCkKKFhFTikgQUNQSTogRkZGRiBDRkQ0MzY4MCwgMDE3NiAocjEgSFAgICAg IFByb0xpYW50ICAgICAgICAxICAg77+9ICAgICAxNjJFKQooWEVOKSBBQ1BJOiBCRVJUIENGRDQz ODAwLCAwMDMwIChyMSBIUCAgICAgUHJvTGlhbnQgICAgICAgIDEgICDvv70gICAgIDE2MkUpCihY RU4pIEFDUEk6IEhFU1QgQ0ZENDM4NDAsIDAwQkMgKHIxIEhQICAgICBQcm9MaWFudCAgICAgICAg MSAgIO+/vSAgICAgMTYyRSkKKFhFTikgU3lzdGVtIFJBTTogNjU1MzJNQiAoNjcxMDU2NzJrQikK KFhFTikgRG9tYWluIGhlYXAgaW5pdGlhbGlzZWQKKFhFTikgUHJvY2Vzc29yICMwIDY6MTUgQVBJ QyB2ZXJzaW9uIDIwCihYRU4pIFByb2Nlc3NvciAjOCA2OjE1IEFQSUMgdmVyc2lvbiAyMAooWEVO KSBQcm9jZXNzb3IgIzE2IDY6MTUgQVBJQyB2ZXJzaW9uIDIwCihYRU4pIFByb2Nlc3NvciAjMjQg NjoxNSBBUElDIHZlcnNpb24gMjAKKFhFTikgUHJvY2Vzc29yICMxIDY6MTUgQVBJQyB2ZXJzaW9u IDIwCihYRU4pIFByb2Nlc3NvciAjOSA2OjE1IEFQSUMgdmVyc2lvbiAyMAooWEVOKSBQcm9jZXNz b3IgIzE3IDY6MTUgQVBJQyB2ZXJzaW9uIDIwCihYRU4pIFByb2Nlc3NvciAjMjUgNjoxNSBBUElD IHZlcnNpb24gMjAKKFhFTikgUHJvY2Vzc29yICMyIDY6MTUgQVBJQyB2ZXJzaW9uIDIwCihYRU4p IFByb2Nlc3NvciAjMTAgNjoxNSBBUElDIHZlcnNpb24gMjAKKFhFTikgUHJvY2Vzc29yICMxOCA2 OjE1IEFQSUMgdmVyc2lvbiAyMAooWEVOKSBQcm9jZXNzb3IgIzI2IDY6MTUgQVBJQyB2ZXJzaW9u IDIwCihYRU4pIFByb2Nlc3NvciAjMyA2OjE1IEFQSUMgdmVyc2lvbiAyMAooWEVOKSBQcm9jZXNz b3IgIzExIDY6MTUgQVBJQyB2ZXJzaW9uIDIwCihYRU4pIFByb2Nlc3NvciAjMTkgNjoxNSBBUElD IHZlcnNpb24gMjAKKFhFTikgUHJvY2Vzc29yICMyNyA2OjE1IEFQSUMgdmVyc2lvbiAyMAooWEVO KSBJT0FQSUNbMF06IGFwaWNfaWQgMSwgdmVyc2lvbiAzMiwgYWRkcmVzcyAweGZlYzAwMDAwLCBH U0kgMC0yMwooWEVOKSBJT0FQSUNbMV06IGFwaWNfaWQgMiwgdmVyc2lvbiAzMiwgYWRkcmVzcyAw eGZlYzgwMDAwLCBHU0kgMjQtNDcKKFhFTikgSU9BUElDWzJdOiBhcGljX2lkIDMsIHZlcnNpb24g MzIsIGFkZHJlc3MgMHhmZWM4MTAwMCwgR1NJIDQ4LTcxCihYRU4pIElPQVBJQ1szXTogYXBpY19p ZCA0LCB2ZXJzaW9uIDMyLCBhZGRyZXNzIDB4ZmVjODE4MDAsIEdTSSA3Mi05NQooWEVOKSBFbmFi bGluZyBBUElDIG1vZGU6ICBQaHlzLiAgVXNpbmcgNCBJL08gQVBJQ3MKKFhFTikgVXNpbmcgc2No ZWR1bGVyOiBTTVAgQ3JlZGl0IFNjaGVkdWxlciAoY3JlZGl0KQooWEVOKSBEZXRlY3RlZCAyNDAw LjE0NSBNSHogcHJvY2Vzc29yLgooWEVOKSBJbml0aW5nIG1lbW9yeSBzaGFyaW5nLgooWEVOKSBW TVg6IFN1cHBvcnRlZCBhZHZhbmNlZCBmZWF0dXJlczoKKFhFTikgIC0gQVBJQyBNTUlPIGFjY2Vz cyB2aXJ0dWFsaXNhdGlvbgooWEVOKSAgLSBBUElDIFRQUiBzaGFkb3cKKFhFTikgIC0gVmlydHVh bCBOTUkKKFhFTikgIC0gTVNSIGRpcmVjdC1hY2Nlc3MgYml0bWFwCihYRU4pIEhWTTogQVNJRHMg ZGlzYWJsZWQuCihYRU4pIEhWTTogVk1YIGVuYWJsZWQKKFhFTikgSS9PIHZpcnR1YWxpc2F0aW9u IGRpc2FibGVkCihYRU4pIFRvdGFsIG9mIDE2IHByb2Nlc3NvcnMgYWN0aXZhdGVkLgooWEVOKSBF TkFCTElORyBJTy1BUElDIElSUXMKKFhFTikgIC0+IFVzaW5nIG5ldyBBQ0sgbWV0aG9kCihYRU4p IGNoZWNraW5nIFRTQyBzeW5jaHJvbml6YXRpb24gYWNyb3NzIDE2IENQVXM6IHBhc3NlZC4KKFhF TikgUGxhdGZvcm0gdGltZXIgaXMgMTQuMzE4TUh6IEhQRVQKKFhFTikgQWxsb2NhdGVkIGNvbnNv bGUgcmluZyBvZiAzMiBLaUIuCihYRU4pIEJyb3VnaHQgdXAgMTYgQ1BVcwooWEVOKSAqKiogTE9B RElORyBET01BSU4gMCAqKioKKFhFTikgIFhlbiAga2VybmVsOiA2NC1iaXQsIGxzYiwgY29tcGF0 MzIKKFhFTikgIERvbTAga2VybmVsOiA2NC1iaXQsIFBBRSwgbHNiLCBwYWRkciAweDEwMDAwMDAg LT4gMHgxNzA4MDAwCihYRU4pIFBIWVNJQ0FMIE1FTU9SWSBBUlJBTkdFTUVOVDoKKFhFTikgIERv bTAgYWxsb2MuOiAgIDAwMDAwMDA4M2MwMDAwMDAtPjAwMDAwMDA4NDAwMDAwMDAgKDc3MDA0OCBw YWdlcwp0byBiZSBhbGxvY2F0ZWQpCihYRU4pIFZJUlRVQUwgTUVNT1JZIEFSUkFOR0VNRU5UOgoo WEVOKSAgTG9hZGVkIGtlcm5lbDogZmZmZmZmZmY4MTAwMDAwMC0+ZmZmZmZmZmY4MTcwODAwMAoo WEVOKSAgSW5pdC4gcmFtZGlzazogZmZmZmZmZmY4MTcwODAwMC0+ZmZmZmZmZmY4MWVmYjAwMAoo WEVOKSAgUGh5cy1NYWNoIG1hcDogZmZmZmZmZmY4MWVmYjAwMC0+ZmZmZmZmZmY4MjRmYjAwMAoo WEVOKSAgU3RhcnQgaW5mbzogICAgZmZmZmZmZmY4MjRmYjAwMC0+ZmZmZmZmZmY4MjRmYjRiNAoo WEVOKSAgUGFnZSB0YWJsZXM6ICAgZmZmZmZmZmY4MjRmYzAwMC0+ZmZmZmZmZmY4MjUxMzAwMAoo WEVOKSAgQm9vdCBzdGFjazogICAgZmZmZmZmZmY4MjUxMzAwMC0+ZmZmZmZmZmY4MjUxNDAwMAoo WEVOKSAgVE9UQUw6ICAgICAgICAgZmZmZmZmZmY4MDAwMDAwMC0+ZmZmZmZmZmY4MjgwMDAwMAoo WEVOKSAgRU5UUlkgQUREUkVTUzogZmZmZmZmZmY4MTUzMTIwMAooWEVOKSBEb20wIGhhcyBtYXhp bXVtIDE2IFZDUFVzCihYRU4pIFNjcnViYmluZyBGcmVlIFJBTToKLi4uLi4uLi4uLi4uLi4uLi4u Li4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4u Li4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4u Li4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4u Li4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4u Li4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4u Li4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4u Li4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4u Li4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4u Li4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4u Li4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4u Li4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi4uLi5kb25lLgooWEVOKSBYZW4gdHJhY2UgYnVmZmVy czogZGlzYWJsZWQKKFhFTikgU3RkLiBMb2dsZXZlbDogRXJyb3JzIGFuZCB3YXJuaW5ncwooWEVO KSBHdWVzdCBMb2dsZXZlbDogRXJyb3JzIGFuZCB3YXJuaW5ncwooWEVOKSBYZW4gaXMgcmVsaW5x dWlzaGluZyBWR0EgY29uc29sZS4KKFhFTikgKioqIFNlcmlhbCBpbnB1dCAtPiBET00wICh0eXBl ICdDVFJMLWEnIHRocmVlIHRpbWVzIHRvIHN3aXRjaAppbnB1dCB0byBYZW4pCihYRU4pIEZyZWVk IDE3NmtCIGluaXQgbWVtb3J5LgooWEVOKSBQbGF0Zm9ybSB0aW1lciBhcHBlYXJzIHRvIGhhdmUg dW5leHBlY3RlZGx5IHdyYXBwZWQgMTAgb3IgbW9yZSB0aW1lcy4KCgphbmQgaGVyZSBpcyB4bSBk bWVzZyBvZiB4ZW4gMy4yIG9uIGEgZGViaWFuIGxlbm55IHN5c3RlbSBydW5uaW5nIG9uCnRoZSBz YW1lIGhhcmR3YXJlLCBvbiB0aGlzIHN5c3RlbSBJIGRvbid0IGhhdmUgY2xvY2sgcHJvYmxlbXM6 CgooWEVOKSBYZW4gdmVyc2lvbiAzLjItMSAoRGViaWFuIDMuMi4xLTIpICh3YWxkaUBkZWJpYW4u b3JnKSAoZ2NjCnZlcnNpb24gNC4zLjEgKERlYmlhbiA0LjMuMS0yKSApIFNhdCBKdW4gMjggMDk6 MzI6MTggVVRDIDIwMDgKKFhFTikgQ29tbWFuZCBsaW5lOgooWEVOKSBWaWRlbyBpbmZvcm1hdGlv bjoKKFhFTikgIFZHQSBpcyB0ZXh0IG1vZGUgODB4MjUsIGZvbnQgOHgxNgooWEVOKSAgVkJFL0RE QyBtZXRob2RzOiBWMjsgRURJRCB0cmFuc2ZlciB0aW1lOiAyIHNlY29uZHMKKFhFTikgRGlzYyBp bmZvcm1hdGlvbjoKKFhFTikgIEZvdW5kIDIgTUJSIHNpZ25hdHVyZXMKKFhFTikgIEZvdW5kIDIg RUREIGluZm9ybWF0aW9uIHN0cnVjdHVyZXMKKFhFTikgWGVuLWU4MjAgUkFNIG1hcDoKKFhFTikg IDAwMDAwMDAwMDAwMDAwMDAgLSAwMDAwMDAwMDAwMDlmNDAwICh1c2FibGUpCihYRU4pICAwMDAw MDAwMDAwMDlmNDAwIC0gMDAwMDAwMDAwMDBhMDAwMCAocmVzZXJ2ZWQpCihYRU4pICAwMDAwMDAw MDAwMGYwMDAwIC0gMDAwMDAwMDAwMDEwMDAwMCAocmVzZXJ2ZWQpCihYRU4pICAwMDAwMDAwMDAw MTAwMDAwIC0gMDAwMDAwMDBjZmQ0MzAwMCAodXNhYmxlKQooWEVOKSAgMDAwMDAwMDBjZmQ0MzAw MCAtIDAwMDAwMDAwY2ZkNGMwMDAgKEFDUEkgZGF0YSkKKFhFTikgIDAwMDAwMDAwY2ZkNGMwMDAg LSAwMDAwMDAwMGNmZDRkMDAwICh1c2FibGUpCihYRU4pICAwMDAwMDAwMGNmZDRkMDAwIC0gMDAw MDAwMDBkMDAwMDAwMCAocmVzZXJ2ZWQpCihYRU4pICAwMDAwMDAwMGUwMDAwMDAwIC0gMDAwMDAw MDBmMDAwMDAwMCAocmVzZXJ2ZWQpCihYRU4pICAwMDAwMDAwMGZlYzAwMDAwIC0gMDAwMDAwMDBm ZWQwMDAwMCAocmVzZXJ2ZWQpCihYRU4pICAwMDAwMDAwMGZlZTAwMDAwIC0gMDAwMDAwMDBmZWUx MDAwMCAocmVzZXJ2ZWQpCihYRU4pICAwMDAwMDAwMGZmYzAwMDAwIC0gMDAwMDAwMDEwMDAwMDAw MCAocmVzZXJ2ZWQpCihYRU4pICAwMDAwMDAwMTAwMDAwMDAwIC0gMDAwMDAwMGYyZmZmZjAwMCAo dXNhYmxlKQooWEVOKSBTeXN0ZW0gUkFNOiA2MTQzNk1CICg2MjkxMTM2OGtCKQooWEVOKSBYZW4g aGVhcDogMTJNQiAoMTMxMjhrQikKKFhFTikgRG9tYWluIGhlYXAgaW5pdGlhbGlzZWQ6IERNQSB3 aWR0aCAzMiBiaXRzCihYRU4pIFByb2Nlc3NvciAjMCA2OjE1IEFQSUMgdmVyc2lvbiAyMAooWEVO KSBQcm9jZXNzb3IgIzggNjoxNSBBUElDIHZlcnNpb24gMjAKKFhFTikgUHJvY2Vzc29yICMxNiA2 OjE1IEFQSUMgdmVyc2lvbiAyMAooWEVOKSBQcm9jZXNzb3IgIzI0IDY6MTUgQVBJQyB2ZXJzaW9u IDIwCihYRU4pIFByb2Nlc3NvciAjMSA2OjE1IEFQSUMgdmVyc2lvbiAyMAooWEVOKSBQcm9jZXNz b3IgIzkgNjoxNSBBUElDIHZlcnNpb24gMjAKKFhFTikgUHJvY2Vzc29yICMxNyA2OjE1IEFQSUMg dmVyc2lvbiAyMAooWEVOKSBQcm9jZXNzb3IgIzI1IDY6MTUgQVBJQyB2ZXJzaW9uIDIwCihYRU4p IFByb2Nlc3NvciAjMiA2OjE1IEFQSUMgdmVyc2lvbiAyMAooWEVOKSBQcm9jZXNzb3IgIzEwIDY6 MTUgQVBJQyB2ZXJzaW9uIDIwCihYRU4pIFByb2Nlc3NvciAjMTggNjoxNSBBUElDIHZlcnNpb24g MjAKKFhFTikgUHJvY2Vzc29yICMyNiA2OjE1IEFQSUMgdmVyc2lvbiAyMAooWEVOKSBQcm9jZXNz b3IgIzMgNjoxNSBBUElDIHZlcnNpb24gMjAKKFhFTikgUHJvY2Vzc29yICMxMSA2OjE1IEFQSUMg dmVyc2lvbiAyMAooWEVOKSBQcm9jZXNzb3IgIzE5IDY6MTUgQVBJQyB2ZXJzaW9uIDIwCihYRU4p IFByb2Nlc3NvciAjMjcgNjoxNSBBUElDIHZlcnNpb24gMjAKKFhFTikgSU9BUElDWzBdOiBhcGlj X2lkIDEsIHZlcnNpb24gMzIsIGFkZHJlc3MgMHhmZWMwMDAwMCwgR1NJIDAtMjMKKFhFTikgSU9B UElDWzFdOiBhcGljX2lkIDIsIHZlcnNpb24gMzIsIGFkZHJlc3MgMHhmZWM4MDAwMCwgR1NJIDI0 LTQ3CihYRU4pIElPQVBJQ1syXTogYXBpY19pZCAzLCB2ZXJzaW9uIDMyLCBhZGRyZXNzIDB4ZmVj ODEwMDAsIEdTSSA0OC03MQooWEVOKSBJT0FQSUNbM106IGFwaWNfaWQgNCwgdmVyc2lvbiAzMiwg YWRkcmVzcyAweGZlYzgxODAwLCBHU0kgNzItOTUKKFhFTikgRW5hYmxpbmcgQVBJQyBtb2RlOiAg UGh5cy4gIFVzaW5nIDQgSS9PIEFQSUNzCihYRU4pIFVzaW5nIHNjaGVkdWxlcjogU01QIENyZWRp dCBTY2hlZHVsZXIgKGNyZWRpdCkKKFhFTikgRGV0ZWN0ZWQgMjQwMC4xMzYgTUh6IHByb2Nlc3Nv ci4KKFhFTikgSFZNOiBWTVggZW5hYmxlZAooWEVOKSBDUFUwOiBJbnRlbChSKSBYZW9uKFIpIENQ VSAgICAgICAgICAgRTczMzAgIEAgMi40MEdIeiBzdGVwcGluZyAwYgooWEVOKSBCb290aW5nIHBy b2Nlc3NvciAxLzggZWlwIDhjMDAwCihYRU4pIENQVTE6IEludGVsKFIpIFhlb24oUikgQ1BVICAg ICAgICAgICBFNzMzMCAgQCAyLjQwR0h6IHN0ZXBwaW5nIDBiCihYRU4pIEJvb3RpbmcgcHJvY2Vz c29yIDIvMTYgZWlwIDhjMDAwCihYRU4pIENQVTI6IEludGVsKFIpIFhlb24oUikgQ1BVICAgICAg ICAgICBFNzMzMCAgQCAyLjQwR0h6IHN0ZXBwaW5nIDBiCihYRU4pIEJvb3RpbmcgcHJvY2Vzc29y IDMvMjQgZWlwIDhjMDAwCihYRU4pIENQVTM6IEludGVsKFIpIFhlb24oUikgQ1BVICAgICAgICAg ICBFNzMzMCAgQCAyLjQwR0h6IHN0ZXBwaW5nIDBiCihYRU4pIEJvb3RpbmcgcHJvY2Vzc29yIDQv MSBlaXAgOGMwMDAKKFhFTikgQ1BVNDogSW50ZWwoUikgWGVvbihSKSBDUFUgICAgICAgICAgIEU3 MzMwICBAIDIuNDBHSHogc3RlcHBpbmcgMGIKKFhFTikgQm9vdGluZyBwcm9jZXNzb3IgNS85IGVp cCA4YzAwMAooWEVOKSBDUFU1OiBJbnRlbChSKSBYZW9uKFIpIENQVSAgICAgICAgICAgRTczMzAg IEAgMi40MEdIeiBzdGVwcGluZyAwYgooWEVOKSBCb290aW5nIHByb2Nlc3NvciA2LzE3IGVpcCA4 YzAwMAooWEVOKSBDUFU2OiBJbnRlbChSKSBYZW9uKFIpIENQVSAgICAgICAgICAgRTczMzAgIEAg Mi40MEdIeiBzdGVwcGluZyAwYgooWEVOKSBCb290aW5nIHByb2Nlc3NvciA3LzI1IGVpcCA4YzAw MAooWEVOKSBDUFU3OiBJbnRlbChSKSBYZW9uKFIpIENQVSAgICAgICAgICAgRTczMzAgIEAgMi40 MEdIeiBzdGVwcGluZyAwYgooWEVOKSBCb290aW5nIHByb2Nlc3NvciA4LzIgZWlwIDhjMDAwCihY RU4pIENQVTg6IEludGVsKFIpIFhlb24oUikgQ1BVICAgICAgICAgICBFNzMzMCAgQCAyLjQwR0h6 IHN0ZXBwaW5nIDBiCihYRU4pIEJvb3RpbmcgcHJvY2Vzc29yIDkvMTAgZWlwIDhjMDAwCihYRU4p IENQVTk6IEludGVsKFIpIFhlb24oUikgQ1BVICAgICAgICAgICBFNzMzMCAgQCAyLjQwR0h6IHN0 ZXBwaW5nIDBiCihYRU4pIEJvb3RpbmcgcHJvY2Vzc29yIDEwLzE4IGVpcCA4YzAwMAooWEVOKSBD UFUxMDogSW50ZWwoUikgWGVvbihSKSBDUFUgICAgICAgICAgIEU3MzMwICBAIDIuNDBHSHogc3Rl cHBpbmcgMGIKKFhFTikgQm9vdGluZyBwcm9jZXNzb3IgMTEvMjYgZWlwIDhjMDAwCihYRU4pIENQ VTExOiBJbnRlbChSKSBYZW9uKFIpIENQVSAgICAgICAgICAgRTczMzAgIEAgMi40MEdIeiBzdGVw cGluZyAwYgooWEVOKSBCb290aW5nIHByb2Nlc3NvciAxMi8zIGVpcCA4YzAwMAooWEVOKSBDUFUx MjogSW50ZWwoUikgWGVvbihSKSBDUFUgICAgICAgICAgIEU3MzMwICBAIDIuNDBHSHogc3RlcHBp bmcgMGIKKFhFTikgQm9vdGluZyBwcm9jZXNzb3IgMTMvMTEgZWlwIDhjMDAwCihYRU4pIENQVTEz OiBJbnRlbChSKSBYZW9uKFIpIENQVSAgICAgICAgICAgRTczMzAgIEAgMi40MEdIeiBzdGVwcGlu ZyAwYgooWEVOKSBCb290aW5nIHByb2Nlc3NvciAxNC8xOSBlaXAgOGMwMDAKKFhFTikgQ1BVMTQ6 IEludGVsKFIpIFhlb24oUikgQ1BVICAgICAgICAgICBFNzMzMCAgQCAyLjQwR0h6IHN0ZXBwaW5n IDBiCihYRU4pIEJvb3RpbmcgcHJvY2Vzc29yIDE1LzI3IGVpcCA4YzAwMAooWEVOKSBDUFUxNTog SW50ZWwoUikgWGVvbihSKSBDUFUgICAgICAgICAgIEU3MzMwICBAIDIuNDBHSHogc3RlcHBpbmcg MGIKKFhFTikgVG90YWwgb2YgMTYgcHJvY2Vzc29ycyBhY3RpdmF0ZWQuCihYRU4pIEVOQUJMSU5H IElPLUFQSUMgSVJRcwooWEVOKSAgLT4gVXNpbmcgbmV3IEFDSyBtZXRob2QKKFhFTikgUGxhdGZv cm0gdGltZXIgb3ZlcmZsb3dzIGluIDE0OTk4IGppZmZpZXMuCihYRU4pIFBsYXRmb3JtIHRpbWVy IGlzIDE0LjMxOE1IeiBIUEVUCihYRU4pIEJyb3VnaHQgdXAgMTYgQ1BVcwooWEVOKSBBTUQgSU9N TVU6IERpc2FibGVkCihYRU4pICoqKiBMT0FESU5HIERPTUFJTiAwICoqKgooWEVOKSAgWGVuICBr ZXJuZWw6IDY0LWJpdCwgbHNiLCBjb21wYXQzMgooWEVOKSAgRG9tMCBrZXJuZWw6IDY0LWJpdCwg bHNiLCBwYWRkciAweDIwMDAwMCAtPiAweDYzMTkxOAooWEVOKSBQSFlTSUNBTCBNRU1PUlkgQVJS QU5HRU1FTlQ6CihYRU4pICBEb20wIGFsbG9jLjogICAwMDAwMDAwOGUwMDAwMDAwLT4wMDAwMDAw OGYwMDAwMDAwICgxNTQyMjU4NQpwYWdlcyB0byBiZSBhbGxvY2F0ZWQpCihYRU4pIFZJUlRVQUwg TUVNT1JZIEFSUkFOR0VNRU5UOgooWEVOKSAgTG9hZGVkIGtlcm5lbDogZmZmZmZmZmY4MDIwMDAw MC0+ZmZmZmZmZmY4MDYzMTkxOAooWEVOKSAgSW5pdC4gcmFtZGlzazogZmZmZmZmZmY4MDYzMjAw MC0+ZmZmZmZmZmY4MGQ0MGUwMAooWEVOKSAgUGh5cy1NYWNoIG1hcDogZmZmZmZmZmY4MGQ0MTAw MC0+ZmZmZmZmZmY4ODM2YjNjOAooWEVOKSAgU3RhcnQgaW5mbzogICAgZmZmZmZmZmY4ODM2YzAw MC0+ZmZmZmZmZmY4ODM2YzRhNAooWEVOKSAgUGFnZSB0YWJsZXM6ICAgZmZmZmZmZmY4ODM2ZDAw MC0+ZmZmZmZmZmY4ODNiNDAwMAooWEVOKSAgQm9vdCBzdGFjazogICAgZmZmZmZmZmY4ODNiNDAw MC0+ZmZmZmZmZmY4ODNiNTAwMAooWEVOKSAgVE9UQUw6ICAgICAgICAgZmZmZmZmZmY4MDAwMDAw MC0+ZmZmZmZmZmY4ODgwMDAwMAooWEVOKSAgRU5UUlkgQUREUkVTUzogZmZmZmZmZmY4MDIwMDAw MAooWEVOKSBEb20wIGhhcyBtYXhpbXVtIDE2IFZDUFVzCihYRU4pIEluaXRyZCBsZW4gMHg3MGVl MDAsIHN0YXJ0IGF0IDB4ZmZmZmZmZmY4MDYzMjAwMAooWEVOKSBTY3J1YmJpbmcgRnJlZSBSQU06 IC5kb25lLgooWEVOKSBYZW4gdHJhY2UgYnVmZmVyczogZGlzYWJsZWQKKFhFTikgU3RkLiBMb2ds ZXZlbDogRXJyb3JzIGFuZCB3YXJuaW5ncwooWEVOKSBHdWVzdCBMb2dsZXZlbDogTm90aGluZyAo UmF0ZS1saW1pdGVkOiBFcnJvcnMgYW5kIHdhcm5pbmdzKQooWEVOKSBYZW4gaXMgcmVsaW5xdWlz aGluZyBWR0EgY29uc29sZS4KKFhFTikgKioqIFNlcmlhbCBpbnB1dCAtPiBET00wICh0eXBlICdD VFJMLWEnIHRocmVlIHRpbWVzIHRvIHN3aXRjaAppbnB1dCB0byBYZW4pCihYRU4pIEZyZWVkIDEw NGtCIGluaXQgbWVtb3J5LgoKX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f X19fX19fX18KWGVuLXVzZXJzIG1haWxpbmcgbGlzdApYZW4tdXNlcnNAbGlzdHMueGVuLm9yZwpo dHRwOi8vbGlzdHMueGVuLm9yZy94ZW4tdXNlcnM= From mboxrd@z Thu Jan 1 00:00:00 1970 From: Pasi =?iso-8859-1?Q?K=E4rkk=E4inen?= Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Sun, 30 Sep 2012 18:13:14 +0300 Message-ID: <20120930151314.GU8912@reaktio.net> References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: Content-Disposition: inline In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Keir Fraser , Jeremy Fitzhardinge , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On Sat, Sep 29, 2012 at 02:19:55PM +0200, Mauro wrote: > It's happened another time, system date 50 minutes ahead. > There is really no solution? > Try with a recent Xen hypervisor version. Xen 4.1.3 or 4.2.0. It helps a lot to know if the issue is still in the latest hypervisor versions or not. 4.0.1 is quite old already.. and besides 4.0.4 is the latest version in 4.0 branch. -- Pasi > root@xen-p02:~# date > sab 29 set 2012, 15.06.25, CEST > > root@xen-p02:~# hwclock --debug > hwclock from util-linux-ng 2.17.2 > Using /dev interface to clock. > Last drift adjustment done at 1348816781 seconds after 1969 > Last calibration done at 1348816781 seconds after 1969 > Hardware clock is on UTC time > Assuming hardware clock is kept in UTC time. > Waiting for clock tick... > ...got clock tick > Time read from Hardware Clock: 2012/09/29 12:16:58 > Hw clock time : 2012/09/29 12:16:58 = 1348921018 seconds since 1969 > sab 29 set 2012 14:16:58 CEST -0.751536 seconds > > root@xen-p02:~# hwclock --show > sab 29 set 2012 14:17:12 CEST -0.423643 seconds > > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.xen.org > http://lists.xen.org/xen-devel From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-devel] Re: Xen 4 TSC problems Date: Sun, 30 Sep 2012 21:23:44 +0200 Message-ID: References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> Mime-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Return-path: In-Reply-To: <20120930151314.GU8912@reaktio.net> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xen.org Errors-To: xen-users-bounces@lists.xen.org To: =?UTF-8?B?UGFzaSBLw6Rya2vDpGluZW4=?= Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Keir Fraser , Jeremy Fitzhardinge , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org T24gMzAgU2VwdGVtYmVyIDIwMTIgMTc6MTMsIFBhc2kgS8Okcmtrw6RpbmVuIDxwYXNpa0Bpa2ku Zmk+IHdyb3RlOgo+IE9uIFNhdCwgU2VwIDI5LCAyMDEyIGF0IDAyOjE5OjU1UE0gKzAyMDAsIE1h dXJvIHdyb3RlOgo+PiBJdCdzIGhhcHBlbmVkIGFub3RoZXIgdGltZSwgc3lzdGVtIGRhdGUgNTAg bWludXRlcyBhaGVhZC4KPj4gVGhlcmUgaXMgcmVhbGx5IG5vIHNvbHV0aW9uPwo+Pgo+Cj4gVHJ5 IHdpdGggYSByZWNlbnQgWGVuIGh5cGVydmlzb3IgdmVyc2lvbi4gWGVuIDQuMS4zIG9yIDQuMi4w Lgo+IEl0IGhlbHBzIGEgbG90IHRvIGtub3cgaWYgdGhlIGlzc3VlIGlzIHN0aWxsIGluIHRoZSBs YXRlc3QgaHlwZXJ2aXNvciB2ZXJzaW9ucyBvciBub3QuCgpJJ20gdXNpbmcgZGViaWFuIHNxdWVl emUgeGVuIGtlcm5lbCBhbmQgdGhpcyBrZXJuZWwgaGFzIHhlbiA0LjAuCgpfX19fX19fX19fX19f X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fXwpYZW4tdXNlcnMgbWFpbGluZyBsaXN0 Clhlbi11c2Vyc0BsaXN0cy54ZW4ub3JnCmh0dHA6Ly9saXN0cy54ZW4ub3JnL3hlbi11c2Vycw== From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-devel] Re: Xen 4 TSC problems Date: Sun, 30 Sep 2012 22:19:22 +0200 Message-ID: References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> Mime-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xen.org Errors-To: xen-users-bounces@lists.xen.org To: =?UTF-8?B?UGFzaSBLw6Rya2vDpGluZW4=?= Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Keir Fraser , Jeremy Fitzhardinge , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org T24gMzAgU2VwdGVtYmVyIDIwMTIgMjE6MjMsIE1hdXJvIDxtcnNhbm5hMUBnbWFpbC5jb20+IHdy b3RlOgo+IE9uIDMwIFNlcHRlbWJlciAyMDEyIDE3OjEzLCBQYXNpIEvDpHJra8OkaW5lbiA8cGFz aWtAaWtpLmZpPiB3cm90ZToKPj4gT24gU2F0LCBTZXAgMjksIDIwMTIgYXQgMDI6MTk6NTVQTSAr MDIwMCwgTWF1cm8gd3JvdGU6Cj4+PiBJdCdzIGhhcHBlbmVkIGFub3RoZXIgdGltZSwgc3lzdGVt IGRhdGUgNTAgbWludXRlcyBhaGVhZC4KPj4+IFRoZXJlIGlzIHJlYWxseSBubyBzb2x1dGlvbj8K Pj4+Cj4+Cj4+IFRyeSB3aXRoIGEgcmVjZW50IFhlbiBoeXBlcnZpc29yIHZlcnNpb24uIFhlbiA0 LjEuMyBvciA0LjIuMC4KPj4gSXQgaGVscHMgYSBsb3QgdG8ga25vdyBpZiB0aGUgaXNzdWUgaXMg c3RpbGwgaW4gdGhlIGxhdGVzdCBoeXBlcnZpc29yIHZlcnNpb25zIG9yIG5vdC4KClRoZXJlIGlz IHNvbWVvbmUgdGhhdCBoYWQgdGhlIHByb2JsZW0gYW5kIHNvbHZlZCB1c2luZyBhIHJlY2VudCB4 ZW4gaHlwZXJ2aXNvcj8KCl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f X19fX19fClhlbi11c2VycyBtYWlsaW5nIGxpc3QKWGVuLXVzZXJzQGxpc3RzLnhlbi5vcmcKaHR0 cDovL2xpc3RzLnhlbi5vcmcveGVuLXVzZXJz From mboxrd@z Thu Jan 1 00:00:00 1970 From: Zary Matej Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Mon, 1 Oct 2012 13:39:11 +0200 Message-ID: <5DB0519124BB3D4DBEEB14426D4AC7EA6A72C9D58E@aries.space.cvtisr.sk> References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net>, Mime-Version: 1.0 Content-Type: text/plain; charset="iso-8859-2" Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: Content-Language: en-GB List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro , =?iso-8859-2?Q?Pasi_K=E4rkk=E4inen?= Cc: Dan Magenheimer , "xen-devel@lists.xensource.com" , Keir Fraser , Jeremy Fitzhardinge , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org >From: xen-users-bounces@lists.xen.org [xen-users-bounces@lists.xen.org] On= Behalf Of Mauro [mrsanna1@gmail.com] >On 30 September 2012 17:13, Pasi K=E4rkk=E4inen wrote: >> On Sat, Sep 29, 2012 at 02:19:55PM +0200, Mauro wrote: >>> It's happened another time, system date 50 minutes ahead. >>> There is really no solution? >>> >> >> Try with a recent Xen hypervisor version. Xen 4.1.3 or 4.2.0. >> It helps a lot to know if the issue is still in the latest hypervisor ve= rsions or not. > >I'm using debian squeeze xen kernel and this kernel has xen 4.0. > Xen and kernel are two different things, you can mix and match them in many= ways. If you don't want to compile from source, use Xen packages from Debi= an Wheezy (4.1.3-2), Sid (4.1.3-3) or Experimental (4.2.0-1). I doubt you w= ill move forward without trying newer versions no matter how many mails you= write to list (all the people) ... :) Matej From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olivier Hanesse Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Mon, 15 Oct 2012 09:39:56 +0200 Message-ID: References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============3773458365543498546==" Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Keir Fraser , Jeremy Fitzhardinge , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org --===============3773458365543498546== Content-Type: multipart/alternative; boundary=f46d0446317ce0d64a04cc1424ae --f46d0446317ce0d64a04cc1424ae Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hello, Strange things on this rainy morning, I got this bug again on another hardware, and Xen hypervisor from Wheezy and kernel from Squeeze. ii linux-image-2.6.32-5-xen-amd64 2.6.32-46 Linux 2.6.32 for 64-bit PCs, Xen dom0 support ii xen-hypervisor-4.1-amd64 4.1.3-2 Xen Hypervisor on AMD64 I've upgraded this server last week. I don't know if it is linked or not, but I didn't get any 'time wrap' on this server for more than 250days. Maybe it is related to the upgrade process. Before the upgrade, my version were : ii linux-image-2.6.32-5-xen-amd64 2.6.32-41squeeze2 Linux 2.6.32 for 64-bit PCs, Xen dom0 support ii xen-hypervisor-4.1-amd64 4.1.2-2 Xen Hypervisor on AMD64 I only upgraded half of my servers, so I will wait a little bit to upgrade the other half and see if this issue occurs again only on updated servers. For the record, always the same errors : xm dmesg =3D> (XEN) Platform timer appears to have unexpectedly wrapped 10 = or more times. /var/log/* =3D> Oct 14 22:46:07 eul2400468 kernel: [734618.562219] Clocksource tsc unstable (delta =3D -2999660313370 ns) I thought it was this issue was hardware related, maybe not. Olivier 2012/9/30 Mauro > On 30 September 2012 21:23, Mauro wrote: > > On 30 September 2012 17:13, Pasi K=E4rkk=E4inen wrote: > >> On Sat, Sep 29, 2012 at 02:19:55PM +0200, Mauro wrote: > >>> It's happened another time, system date 50 minutes ahead. > >>> There is really no solution? > >>> > >> > >> Try with a recent Xen hypervisor version. Xen 4.1.3 or 4.2.0. > >> It helps a lot to know if the issue is still in the latest hypervisor > versions or not. > > There is someone that had the problem and solved using a recent xen > hypervisor? > --f46d0446317ce0d64a04cc1424ae Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable Hello,=A0

Strange things on this rainy morning, I got th= is bug again on another hardware, and Xen hypervisor from Wheezy and kernel= from Squeeze.

ii =A0linux-image-2.6.32-5-xen= -amd64 =A0 =A0 =A02.6.32-46 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0Linux 2.6.32= for 64-bit PCs, Xen dom0 support
ii =A0xen-hypervisor-4.1-amd64 =A0 =A0 =A0 =A0 =A0 =A0 =A0 4.1.3-2 =A0= =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0Xen Hypervisor on AMD64
<= br>
I've upgraded this server last week. I don't know if = it is linked or not, but I didn't get any 'time wrap' on this s= erver for more than 250days.
Maybe it is related to the upgrade process.=A0Before the upgrade, my v= ersion were :

ii =A0linux-image-2.6.32-5-xen-= amd64 =A0 2.6.32-41squeeze2 =A0 =A0 =A0 =A0 =A0 =A0Linux 2.6.32 for 64-bit = PCs, Xen dom0 support
ii =A0xen-hypervisor-4.1-amd64 =A0 =A0 =A0 =A0 =A0 =A04.1.2-2 =A0 =A0 = =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0 =A0Xen Hypervisor on AMD64
=
I only upgraded half of my servers, so I will wait a little = bit to upgrade the other half and see if this issue occurs again only on up= dated servers.

For the record, always the same errors :=A0
<= br>
xm dmesg =3D>=A0(XEN) Platform timer appears to have unexp= ectedly wrapped 10 or more times.
=A0
/var/log/* = =A0 =3D> Oct 14 22:46:07 eul2400468 kernel: [734618.562219] Clocksource = tsc unstable (delta =3D -2999660313370 ns)

I thought it was this issue was hardware related,= maybe not.

Olivier

2012/9/30 Mauro <mrsanna1@gmail.com>
On 30 September 2012 21:23= , Mauro <mrsanna1@gmail.com>= ; wrote:
> On 30 September 2012 17:13, Pasi K=E4rkk=E4inen <pasik@iki.fi> wrote:
>> On Sat, Sep 29, 2012 at 02:19:55PM +0200, Mauro wrote:
>>> It's happened another time, system date 50 minutes ahead.<= br> >>> There is really no solution?
>>>
>>
>> Try with a recent Xen hypervisor version. Xen 4.1.3 or 4.2.0.
>> It helps a lot to know if the issue is still in the latest hypervi= sor versions or not.

There is someone that had the problem and solved using a recent xen h= ypervisor?

--f46d0446317ce0d64a04cc1424ae-- --===============3773458365543498546== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xen.org http://lists.xen.org/xen-devel --===============3773458365543498546==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Mon, 15 Oct 2012 08:05:43 +0000 Message-ID: References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============2670730164236996525==" Return-path: In-Reply-To: Content-Language: en-US List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: olivier.hanesse@gmail.com, mrsanna1@gmail.com Cc: dan.magenheimer@oracle.com, xen-devel@lists.xensource.com, keir@xen.org, jeremy@goop.org, keir.xen@gmail.com, xen-users@lists.xensource.com, mark@campbell-lange.net List-Id: xen-devel@lists.xenproject.org --===============2670730164236996525== Content-Language: en-US Content-Type: multipart/alternative; boundary="_000_FF93AF260AC2BB499A119CC65B092CF73148901Bsg000713corproo_" --_000_FF93AF260AC2BB499A119CC65B092CF73148901Bsg000713corproo_ Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hi Oliver bad news, this means that xen 4.1xxx doesn't solve this issue ... on my side, on my hardware that produce this bug ,I never had the problem = whit this combination : (100% WHEEZY) ii xen-hypervisor-4.1-amd64 4.1.3-2 amd= 64 Xen Hypervisor on AMD64 ii linux-image-3.2.0-3-amd64 3.2.23-1 amd= 64 Linux 3.2 for 64-bit PCs (this was because I had a great hope that 4.1.xxx solved th= e problem ...) Philippe From: xen-devel-bounces@lists.xen.org [mailto:xen-devel-bounces@lists.xen.o= rg] On Behalf Of Olivier Hanesse Sent: Monday, October 15, 2012 9:40 AM To: Mauro Cc: Dan Magenheimer; xen-devel@lists.xensource.com; Keir Fraser; Jeremy Fit= zhardinge; Keir Fraser; Xen Users; Mark Adams Subject: Re: [Xen-devel] [Xen-users] Re: Xen 4 TSC problems Hello, Strange things on this rainy morning, I got this bug again on another hardw= are, and Xen hypervisor from Wheezy and kernel from Squeeze. ii linux-image-2.6.32-5-xen-amd64 2.6.32-46 Linux 2.= 6.32 for 64-bit PCs, Xen dom0 support ii xen-hypervisor-4.1-amd64 4.1.3-2 Xen= Hypervisor on AMD64 I've upgraded this server last week. I don't know if it is linked or not, b= ut I didn't get any 'time wrap' on this server for more than 250days. Maybe it is related to the upgrade process. Before the upgrade, my version = were : ii linux-image-2.6.32-5-xen-amd64 2.6.32-41squeeze2 Linux 2.6= .32 for 64-bit PCs, Xen dom0 support ii xen-hypervisor-4.1-amd64 4.1.2-2 Xen Hy= pervisor on AMD64 I only upgraded half of my servers, so I will wait a little bit to upgrade = the other half and see if this issue occurs again only on updated servers. For the record, always the same errors : xm dmesg =3D> (XEN) Platform timer appears to have unexpectedly wrapped 10 = or more times. /var/log/* =3D> Oct 14 22:46:07 eul2400468 kernel: [734618.562219] Clocks= ource tsc unstable (delta =3D -2999660313370 ns) I thought it was this issue was hardware related, maybe not. Olivier 2012/9/30 Mauro > On 30 September 2012 21:23, Mauro > wrote: > On 30 September 2012 17:13, Pasi K=E4rkk=E4inen > wrote: >> On Sat, Sep 29, 2012 at 02:19:55PM +0200, Mauro wrote: >>> It's happened another time, system date 50 minutes ahead. >>> There is really no solution? >>> >> >> Try with a recent Xen hypervisor version. Xen 4.1.3 or 4.2.0. >> It helps a lot to know if the issue is still in the latest hypervisor ve= rsions or not. There is someone that had the problem and solved using a recent xen hypervi= sor? --_000_FF93AF260AC2BB499A119CC65B092CF73148901Bsg000713corproo_ Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable

Hi Oliver<= o:p>

 = ;

bad news, = this means that xen 4.1xxx doesn’t solve this issue …

 = ;

on my side= , on my  hardware that produce this bug ,I never had the problem whit = this combination : (100% WHEEZY)

ii  x= en-hypervisor-4.1-amd64         &nb= sp;        4.1.3-2   = ;            &n= bsp;   amd64        Xen Hyperv= isor on AMD64

ii  l= inux-image-3.2.0-3-amd64        &nb= sp;        3.2.23-1   &nb= sp;            =   amd64        Linux 3.2 for 64-bit= PCs

 = ;

 &nbs= p;            &= nbsp; (this was because I had a great hope that 4.1.xxx solved the problem = …)

 = ;

Philippe

 = ;

 = ;

 = ;

From: xen-devel-bounces@lists.xen.org [mailto:xen-devel-bou= nces@lists.xen.org] On Behalf Of Olivier Hanesse
Sent: Monday, October 15, 2012 9:40 AM
To: Mauro
Cc: Dan Magenheimer; xen-devel@lists.xensource.com; Keir Fraser; Jer= emy Fitzhardinge; Keir Fraser; Xen Users; Mark Adams
Subject: Re: [Xen-devel] [Xen-users] Re: Xen 4 TSC problems

 

Hello, 

 

Strange things on this rainy morning, I got this bug= again on another hardware, and Xen hypervisor from Wheezy and kernel from = Squeeze.

 

ii  linux-image-2.6.32-5-xen-amd64    = ;  2.6.32-46                 &= nbsp;Linux 2.6.32 for 64-bit PCs, Xen dom0 support

ii  xen-hypervisor-4.1-amd64     &nbs= p;         4.1.3-2           &= nbsp;          Xen Hypervisor on AMD64<= /p>

 

I've upgraded this server last week. I don't know if= it is linked or not, but I didn't get any 'time wrap' on this server for m= ore than 250days.

Maybe it is related to the upgrade process. Bef= ore the upgrade, my version were :

 

ii  linux-image-2.6.32-5-xen-amd64   2.6.3= 2-41squeeze2            Linux 2.6.32 for 64-b= it PCs, Xen dom0 support

ii  xen-hypervisor-4.1-amd64     &nbs= p;      4.1.2-2             &n= bsp;        Xen Hypervisor on AMD64

 

I only upgraded half of my servers, so I will wait a= little bit to upgrade the other half and see if this issue occurs again on= ly on updated servers.

 

For the record, always the same errors : <= /o:p>

 

xm dmesg =3D> (XEN) Platform timer appears t= o have unexpectedly wrapped 10 or more times.

 

/var/log/*   =3D> Oct 14 22:46:07 eul2400468= kernel: [734618.562219] Clocksource tsc unstable (delta =3D -2999660313370= ns)

 

I thought it was this issue was hardware related, ma= ybe not.

 

Olivier

 

2012/9/30 Mauro <mrsanna1@gmail.com>

On 30 September 2012 = 21:23, Mauro <mrsanna1@gmail.com> wrote:
> On 30 September 2012 17:13, Pasi K=E4rkk=E4inen <
pasik@iki.fi> wrote:
>> On Sat, Sep 29, 2012 at 02:19:55PM +0200, Mauro wrote:
>>> It's happened another time, system date 50 minutes ahead.
>>> There is really no solution?
>>>
>>
>> Try with a recent Xen hypervisor version. Xen 4.1.3 or 4.2.0.
>> It helps a lot to know if the issue is still in the latest hypervi= sor versions or not.

There is someone that had the problem and solved usi= ng a recent xen hypervisor?

 

--_000_FF93AF260AC2BB499A119CC65B092CF73148901Bsg000713corproo_-- --===============2670730164236996525== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xen.org http://lists.xen.org/xen-devel --===============2670730164236996525==-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Mon, 15 Oct 2012 11:39:23 +0200 Message-ID: References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> Mime-Version: 1.0 Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: base64 Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Philippe.Simonet@swisscom.com Cc: dan.magenheimer@oracle.com, xen-devel@lists.xensource.com, keir@xen.org, jeremy@goop.org, olivier.hanesse@gmail.com, keir.xen@gmail.com, xen-users@lists.xensource.com, mark@campbell-lange.net List-Id: xen-devel@lists.xenproject.org cmVhbGx5IHJlYWxseSBiYWQgbmV3cywgSSBob3BlIHRoaXMgYW5ub3lpbmcgcHJvYmxlbSB3aWxs IGJlIHJlc29sdmVkIHZlcnkgc29vbi4KCgpPbiAxNSBPY3RvYmVyIDIwMTIgMTA6MDUsICA8UGhp bGlwcGUuU2ltb25ldEBzd2lzc2NvbS5jb20+IHdyb3RlOgo+IEhpIE9saXZlcgo+Cj4KPgo+IGJh ZCBuZXdzLCB0aGlzIG1lYW5zIHRoYXQgeGVuIDQuMXh4eCBkb2VzbuKAmXQgc29sdmUgdGhpcyBp c3N1ZSDigKYKPgo+Cj4KPiBvbiBteSBzaWRlLCBvbiBteSAgaGFyZHdhcmUgdGhhdCBwcm9kdWNl IHRoaXMgYnVnICxJIG5ldmVyIGhhZCB0aGUgcHJvYmxlbQo+IHdoaXQgdGhpcyBjb21iaW5hdGlv biA6ICgxMDAlIFdIRUVaWSkKPgo+IGlpICB4ZW4taHlwZXJ2aXNvci00LjEtYW1kNjQgICAgICAg ICAgICAgICAgICA0LjEuMy0yCj4gYW1kNjQgICAgICAgIFhlbiBIeXBlcnZpc29yIG9uIEFNRDY0 Cj4KPiBpaSAgbGludXgtaW1hZ2UtMy4yLjAtMy1hbWQ2NCAgICAgICAgICAgICAgICAgMy4yLjIz LTEKPiBhbWQ2NCAgICAgICAgTGludXggMy4yIGZvciA2NC1iaXQgUENzCj4KPgo+Cj4gICAgICAg ICAgICAgICAgICh0aGlzIHdhcyBiZWNhdXNlIEkgaGFkIGEgZ3JlYXQgaG9wZSB0aGF0IDQuMS54 eHggc29sdmVkIHRoZQo+IHByb2JsZW0g4oCmKQo+Cj4KPgo+IFBoaWxpcHBlCj4KPgo+Cj4KPgo+ Cj4KPiBGcm9tOiB4ZW4tZGV2ZWwtYm91bmNlc0BsaXN0cy54ZW4ub3JnCj4gW21haWx0bzp4ZW4t ZGV2ZWwtYm91bmNlc0BsaXN0cy54ZW4ub3JnXSBPbiBCZWhhbGYgT2YgT2xpdmllciBIYW5lc3Nl Cj4gU2VudDogTW9uZGF5LCBPY3RvYmVyIDE1LCAyMDEyIDk6NDAgQU0KPiBUbzogTWF1cm8KPiBD YzogRGFuIE1hZ2VuaGVpbWVyOyB4ZW4tZGV2ZWxAbGlzdHMueGVuc291cmNlLmNvbTsgS2VpciBG cmFzZXI7IEplcmVteQo+IEZpdHpoYXJkaW5nZTsgS2VpciBGcmFzZXI7IFhlbiBVc2VyczsgTWFy ayBBZGFtcwo+IFN1YmplY3Q6IFJlOiBbWGVuLWRldmVsXSBbWGVuLXVzZXJzXSBSZTogWGVuIDQg VFNDIHByb2JsZW1zCj4KPgo+Cj4gSGVsbG8sCj4KPgo+Cj4gU3RyYW5nZSB0aGluZ3Mgb24gdGhp cyByYWlueSBtb3JuaW5nLCBJIGdvdCB0aGlzIGJ1ZyBhZ2FpbiBvbiBhbm90aGVyCj4gaGFyZHdh cmUsIGFuZCBYZW4gaHlwZXJ2aXNvciBmcm9tIFdoZWV6eSBhbmQga2VybmVsIGZyb20gU3F1ZWV6 ZS4KPgo+Cj4KPiBpaSAgbGludXgtaW1hZ2UtMi42LjMyLTUteGVuLWFtZDY0ICAgICAgMi42LjMy LTQ2ICAgICAgICAgICAgICAgICAgTGludXgKPiAyLjYuMzIgZm9yIDY0LWJpdCBQQ3MsIFhlbiBk b20wIHN1cHBvcnQKPgo+IGlpICB4ZW4taHlwZXJ2aXNvci00LjEtYW1kNjQgICAgICAgICAgICAg ICA0LjEuMy0yICAgICAgICAgICAgICAgICAgICAgIFhlbgo+IEh5cGVydmlzb3Igb24gQU1ENjQK Pgo+Cj4KPiBJJ3ZlIHVwZ3JhZGVkIHRoaXMgc2VydmVyIGxhc3Qgd2Vlay4gSSBkb24ndCBrbm93 IGlmIGl0IGlzIGxpbmtlZCBvciBub3QsCj4gYnV0IEkgZGlkbid0IGdldCBhbnkgJ3RpbWUgd3Jh cCcgb24gdGhpcyBzZXJ2ZXIgZm9yIG1vcmUgdGhhbiAyNTBkYXlzLgo+Cj4gTWF5YmUgaXQgaXMg cmVsYXRlZCB0byB0aGUgdXBncmFkZSBwcm9jZXNzLiBCZWZvcmUgdGhlIHVwZ3JhZGUsIG15IHZl cnNpb24KPiB3ZXJlIDoKPgo+Cj4KPiBpaSAgbGludXgtaW1hZ2UtMi42LjMyLTUteGVuLWFtZDY0 ICAgMi42LjMyLTQxc3F1ZWV6ZTIgICAgICAgICAgICBMaW51eAo+IDIuNi4zMiBmb3IgNjQtYml0 IFBDcywgWGVuIGRvbTAgc3VwcG9ydAo+Cj4gaWkgIHhlbi1oeXBlcnZpc29yLTQuMS1hbWQ2NCAg ICAgICAgICAgIDQuMS4yLTIgICAgICAgICAgICAgICAgICAgICAgWGVuCj4gSHlwZXJ2aXNvciBv biBBTUQ2NAo+Cj4KPgo+IEkgb25seSB1cGdyYWRlZCBoYWxmIG9mIG15IHNlcnZlcnMsIHNvIEkg d2lsbCB3YWl0IGEgbGl0dGxlIGJpdCB0byB1cGdyYWRlCj4gdGhlIG90aGVyIGhhbGYgYW5kIHNl ZSBpZiB0aGlzIGlzc3VlIG9jY3VycyBhZ2FpbiBvbmx5IG9uIHVwZGF0ZWQgc2VydmVycy4KPgo+ Cj4KPiBGb3IgdGhlIHJlY29yZCwgYWx3YXlzIHRoZSBzYW1lIGVycm9ycyA6Cj4KPgo+Cj4geG0g ZG1lc2cgPT4gKFhFTikgUGxhdGZvcm0gdGltZXIgYXBwZWFycyB0byBoYXZlIHVuZXhwZWN0ZWRs eSB3cmFwcGVkIDEwIG9yCj4gbW9yZSB0aW1lcy4KPgo+Cj4KPiAvdmFyL2xvZy8qICAgPT4gT2N0 IDE0IDIyOjQ2OjA3IGV1bDI0MDA0Njgga2VybmVsOiBbNzM0NjE4LjU2MjIxOV0KPiBDbG9ja3Nv dXJjZSB0c2MgdW5zdGFibGUgKGRlbHRhID0gLTI5OTk2NjAzMTMzNzAgbnMpCj4KPgo+Cj4gSSB0 aG91Z2h0IGl0IHdhcyB0aGlzIGlzc3VlIHdhcyBoYXJkd2FyZSByZWxhdGVkLCBtYXliZSBub3Qu Cj4KPgo+Cj4gT2xpdmllcgo+Cj4KPgo+IDIwMTIvOS8zMCBNYXVybyA8bXJzYW5uYTFAZ21haWwu Y29tPgo+Cj4gT24gMzAgU2VwdGVtYmVyIDIwMTIgMjE6MjMsIE1hdXJvIDxtcnNhbm5hMUBnbWFp bC5jb20+IHdyb3RlOgo+PiBPbiAzMCBTZXB0ZW1iZXIgMjAxMiAxNzoxMywgUGFzaSBLw6Rya2vD pGluZW4gPHBhc2lrQGlraS5maT4gd3JvdGU6Cj4+PiBPbiBTYXQsIFNlcCAyOSwgMjAxMiBhdCAw MjoxOTo1NVBNICswMjAwLCBNYXVybyB3cm90ZToKPj4+PiBJdCdzIGhhcHBlbmVkIGFub3RoZXIg dGltZSwgc3lzdGVtIGRhdGUgNTAgbWludXRlcyBhaGVhZC4KPj4+PiBUaGVyZSBpcyByZWFsbHkg bm8gc29sdXRpb24/Cj4+Pj4KPj4+Cj4+PiBUcnkgd2l0aCBhIHJlY2VudCBYZW4gaHlwZXJ2aXNv ciB2ZXJzaW9uLiBYZW4gNC4xLjMgb3IgNC4yLjAuCj4+PiBJdCBoZWxwcyBhIGxvdCB0byBrbm93 IGlmIHRoZSBpc3N1ZSBpcyBzdGlsbCBpbiB0aGUgbGF0ZXN0IGh5cGVydmlzb3IKPj4+IHZlcnNp b25zIG9yIG5vdC4KPgo+IFRoZXJlIGlzIHNvbWVvbmUgdGhhdCBoYWQgdGhlIHByb2JsZW0gYW5k IHNvbHZlZCB1c2luZyBhIHJlY2VudCB4ZW4KPiBoeXBlcnZpc29yPwo+Cj4KCl9fX19fX19fX19f X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fClhlbi1kZXZlbCBtYWlsaW5nIGxp c3QKWGVuLWRldmVsQGxpc3RzLnhlbi5vcmcKaHR0cDovL2xpc3RzLnhlbi5vcmcveGVuLWRldmVs Cg== From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Jan Beulich" Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Mon, 15 Oct 2012 11:32:06 +0100 Message-ID: <507C024602000078000A155C@nat28.tlf.novell.com> References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: Content-Disposition: inline List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro , Olivier Hanesse Cc: Dan Magenheimer , xen-devel@lists.xensource.com, Keir Fraser , Jeremy Fitzhardinge , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org >>> On 15.10.12 at 09:39, Olivier Hanesse wrote: > For the record, always the same errors : > > xm dmesg => (XEN) Platform timer appears to have unexpectedly wrapped 10 or > more times. This is what needs to be analyzed: For it to happen, timer softirqs must not occur for quite long a period of time, and it needs to be determined what that is. Since only very few people can actually see this problem, we depend on at least some data collection (including figuring out what hardware and/or software components are involved in surfacing the problem) being done by them. Jan From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-devel] Re: Xen 4 TSC problems Date: Mon, 15 Oct 2012 13:24:32 +0200 Message-ID: References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <507C024602000078000A155C@nat28.tlf.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xen.org Errors-To: xen-users-bounces@lists.xen.org To: Jan Beulich Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Dan Magenheimer , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 15 October 2012 12:32, Jan Beulich wrote: >>>> On 15.10.12 at 09:39, Olivier Hanesse wrote: >> For the record, always the same errors : >> >> xm dmesg => (XEN) Platform timer appears to have unexpectedly wrapped 10 or >> more times. > > This is what needs to be analyzed: For it to happen, timer softirqs > must not occur for quite long a period of time, and it needs to be > determined what that is. Since only very few people can actually > see this problem, we depend on at least some data collection > (including figuring out what hardware and/or software components > are involved in surfacing the problem) being done by them. I have the problem on this hardware type: Hp Proliant DL580 G5 with four Intel(R) Xeon(R) CPU E7330 @ 2.40GHz. It seem that GRUB_CMDLINE_XEN="clocksource=pit cpuidle=0" put in in /etc/default/grup (I use linux debian) solves the problem for me. From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Jan Beulich" Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Mon, 15 Oct 2012 13:49:14 +0100 Message-ID: <507C226A02000078000A1657@nat28.tlf.novell.com> References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: Content-Disposition: inline List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Dan Magenheimer , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org >>> On 15.10.12 at 13:24, Mauro wrote: > I have the problem on this hardware type: > > Hp Proliant DL580 G5 with four Intel(R) Xeon(R) CPU E7330 @ 2.40GHz. > It seem that > GRUB_CMDLINE_XEN="clocksource=pit cpuidle=0" > put in in /etc/default/grup (I use linux debian) > solves the problem for me. Did you check whether either or both options on their own also make the problem go away? Jan From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Mon, 15 Oct 2012 16:25:27 +0200 Message-ID: References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> <507C226A02000078000A1657@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <507C226A02000078000A1657@nat28.tlf.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Jan Beulich Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Dan Magenheimer , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 15 October 2012 14:49, Jan Beulich wrote: >>>> On 15.10.12 at 13:24, Mauro wrote: >> I have the problem on this hardware type: >> >> Hp Proliant DL580 G5 with four Intel(R) Xeon(R) CPU E7330 @ 2.40GHz. >> It seem that >> GRUB_CMDLINE_XEN="clocksource=pit cpuidle=0" >> put in in /etc/default/grup (I use linux debian) >> solves the problem for me. > > Did you check whether either or both options on their own also > make the problem go away? Only clocksource=pit does not solve the problem, I've not tried with only cpuidle=0, I will try soon. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Keir Fraser Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Wed, 17 Oct 2012 17:15:40 +0100 Message-ID: References: Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="B_3433338947_42397714" Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro , Jan Beulich Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Dan Magenheimer , Olivier Hanesse , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org > This message is in MIME format. Since your mail reader does not understand this format, some or all of this message may not be legible. --B_3433338947_42397714 Content-type: text/plain; charset="US-ASCII" Content-transfer-encoding: 7bit On 15/10/2012 15:25, "Mauro" wrote: > On 15 October 2012 14:49, Jan Beulich wrote: >>>>> On 15.10.12 at 13:24, Mauro wrote: >>> I have the problem on this hardware type: >>> >>> Hp Proliant DL580 G5 with four Intel(R) Xeon(R) CPU E7330 @ 2.40GHz. >>> It seem that >>> GRUB_CMDLINE_XEN="clocksource=pit cpuidle=0" >>> put in in /etc/default/grup (I use linux debian) >>> solves the problem for me. >> >> Did you check whether either or both options on their own also >> make the problem go away? > > Only clocksource=pit does not solve the problem, I've not tried with > only cpuidle=0, I will try soon. The problem here is that the platform timer has *not* wrapped. In fact it is almost certainly correct, and it is the calculation of current-system-time extrapolated from local CPU's TSC that has gone haywire. The overflow-handling logic in plt_overflow() then propagates that incorrectness into plt_stamp64 (up to a maximum of 10 times wrapping the platform timer's counter). This means that platform time is incorrect (skips forward) and soon after will infect the local time estimation for all CPUs. I've attached a patch which will (a) stop plt_overflow() from misguidedly trying to fix up apparent platform timer overflow; and (b) will print possibly-useful diagnostics when apparent 'timer overflow' occurs. Such lines will be prefixed "XXX plt_overflow:" in the hypervisor log. Patch is against xen-unstable but I'm sure it must backport to older trees quite trivially. -- Keir --B_3433338947_42397714 Content-type: application/octet-stream; name="00-tsc-debug" Content-disposition: attachment; filename="00-tsc-debug" Content-transfer-encoding: base64 ZGlmZiAtciBjMWM1NDljNGZlOWUgeGVuL2FyY2gveDg2L3RpbWUuYwotLS0gYS94ZW4vYXJj aC94ODYvdGltZS5jCU1vbiBPY3QgMTUgMTY6NTE6NDQgMjAxMiArMDEwMAorKysgYi94ZW4v YXJjaC94ODYvdGltZS5jCVdlZCBPY3QgMTcgMTc6MTM6MjIgMjAxMiArMDEwMApAQCAtNTIz LDExICs1MjMsMTIgQEAgc3RhdGljIHNfdGltZV90IF9fcmVhZF9wbGF0Zm9ybV9zdGltZSh1 Ngogc3RhdGljIHZvaWQgcGx0X292ZXJmbG93KHZvaWQgKnVudXNlZCkKIHsKICAgICBpbnQg aTsKLSAgICB1NjQgY291bnQ7CisgICAgdTY0IGNvdW50LCBvbGRfc3RhbXAsIHRzYzsKICAg ICBzX3RpbWVfdCBub3csIHBsdF9ub3csIHBsdF93cmFwOwogCiAgICAgc3Bpbl9sb2NrX2ly cSgmcGxhdGZvcm1fdGltZXJfbG9jayk7CiAKKyAgICBvbGRfc3RhbXAgPSBwbHRfc3RhbXA7 CiAgICAgY291bnQgPSBwbHRfc3JjLnJlYWRfY291bnRlcigpOwogICAgIHBsdF9zdGFtcDY0 ICs9IChjb3VudCAtIHBsdF9zdGFtcCkgJiBwbHRfbWFzazsKICAgICBwbHRfc3RhbXAgPSBj b3VudDsKQEAgLTU0MCw2ICs1NDEsMTQgQEAgc3RhdGljIHZvaWQgcGx0X292ZXJmbG93KHZv aWQgKnVudXNlZCkKICAgICAgICAgcGx0X3dyYXAgPSBfX3JlYWRfcGxhdGZvcm1fc3RpbWUo cGx0X3N0YW1wNjQgKyBwbHRfbWFzayArIDEpOwogICAgICAgICBpZiAoIEFCUyhwbHRfd3Jh cCAtIG5vdykgPiBBQlMocGx0X25vdyAtIG5vdykgKQogICAgICAgICAgICAgYnJlYWs7Cisg ICAgICAgIHJkdHNjbGwodHNjKTsKKyAgICAgICAgcHJpbnRrKCJYWFggcGx0X292ZXJmbG93 OiBwbHRfbm93PSUiUFJJeDY0IiBwbHRfd3JhcD0lIlBSSXg2NAorICAgICAgICAgICAgICAg IiBub3c9JSJQUkl4NjQiIG9sZF9zdGFtcD0lIlBSSXg2NCIgbmV3X3N0YW1wPSUiUFJJeDY0 CisgICAgICAgICAgICAgICAiIHBsdF9zdGFtcDY0PSUiUFJJeDY0IiBwbHRfbWFzaz0lIlBS SXg2NAorICAgICAgICAgICAgICAgIiB0c2M9JSJQUkl4NjQiIHRzY19zdGFtcD0lIlBSSXg2 NCJcbiIsCisgICAgICAgICAgICAgICBwbHRfbm93LCBwbHRfd3JhcCwgbm93LCBvbGRfc3Rh bXAsIHBsdF9zdGFtcCwgcGx0X3N0YW1wNjQsCisgICAgICAgICAgICAgICBwbHRfbWFzaywg dHNjLCB0aGlzX2NwdShjcHVfdGltZSkubG9jYWxfdHNjX3N0YW1wKTsKKyAgICAgICAgYnJl YWs7CiAgICAgICAgIHBsdF9zdGFtcDY0ICs9IHBsdF9tYXNrICsgMTsKICAgICB9CiAgICAg aWYgKCBpICE9IDAgKQo= --B_3433338947_42397714 Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xen.org http://lists.xen.org/xen-devel --B_3433338947_42397714-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ian Campbell Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Thu, 18 Oct 2012 08:40:21 +0100 Message-ID: <1350546021.28188.20.camel@dagon.hellion.org.uk> References: Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Keir Fraser Cc: Jeremy Fitzhardinge , "xen-devel@lists.xensource.com" , Dan Magenheimer , Mauro , Olivier Hanesse , Jan Beulich , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On Wed, 2012-10-17 at 17:15 +0100, Keir Fraser wrote: > @@ -540,6 +541,14 @@ static void plt_overflow(void *unused) > plt_wrap = __read_platform_stime(plt_stamp64 + plt_mask + 1); > if ( ABS(plt_wrap - now) > ABS(plt_now - now) ) > break; > + rdtscll(tsc); > + printk("XXX plt_overflow: plt_now=%"PRIx64" plt_wrap=%"PRIx64 > + " now=%"PRIx64" old_stamp=%"PRIx64" new_stamp=%"PRIx64 > + " plt_stamp64=%"PRIx64" plt_mask=%"PRIx64 > + " tsc=%"PRIx64" tsc_stamp=%"PRIx64"\n", > + plt_now, plt_wrap, now, old_stamp, plt_stamp, plt_stamp64, > + plt_mask, tsc, this_cpu(cpu_time).local_tsc_stamp); > + break; Is the break here, making the following update to plt_stamp64 dead code deliberate? > plt_stamp64 += plt_mask + 1; > } > if ( i != 0 ) Ian. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Keir Fraser Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Thu, 18 Oct 2012 08:55:04 +0100 Message-ID: References: <1350546021.28188.20.camel@dagon.hellion.org.uk> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <1350546021.28188.20.camel@dagon.hellion.org.uk> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Ian Campbell Cc: Jeremy Fitzhardinge , "xen-devel@lists.xensource.com" , Dan Magenheimer , Mauro , Olivier Hanesse , Jan Beulich , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 18/10/2012 08:40, "Ian Campbell" wrote: > On Wed, 2012-10-17 at 17:15 +0100, Keir Fraser wrote: >> @@ -540,6 +541,14 @@ static void plt_overflow(void *unused) >> plt_wrap = __read_platform_stime(plt_stamp64 + plt_mask + 1); >> if ( ABS(plt_wrap - now) > ABS(plt_now - now) ) >> break; >> + rdtscll(tsc); >> + printk("XXX plt_overflow: plt_now=%"PRIx64" plt_wrap=%"PRIx64 >> + " now=%"PRIx64" old_stamp=%"PRIx64" new_stamp=%"PRIx64 >> + " plt_stamp64=%"PRIx64" plt_mask=%"PRIx64 >> + " tsc=%"PRIx64" tsc_stamp=%"PRIx64"\n", >> + plt_now, plt_wrap, now, old_stamp, plt_stamp, plt_stamp64, >> + plt_mask, tsc, this_cpu(cpu_time).local_tsc_stamp); >> + break; > > Is the break here, making the following update to plt_stamp64 dead code > deliberate? Yes, it's a hack to disable the timer-has-apparently-wrapped workaround. -- Keir >> plt_stamp64 += plt_mask + 1; >> } >> if ( i != 0 ) > > Ian. > > From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ian Campbell Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Thu, 18 Oct 2012 09:33:52 +0100 Message-ID: <1350549232.28188.39.camel@dagon.hellion.org.uk> References: Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Keir Fraser Cc: Jeremy Fitzhardinge , "xen-devel@lists.xensource.com" , Dan Magenheimer , Mauro , Olivier Hanesse , Jan Beulich , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On Thu, 2012-10-18 at 08:55 +0100, Keir Fraser wrote: > On 18/10/2012 08:40, "Ian Campbell" wrote: > > > On Wed, 2012-10-17 at 17:15 +0100, Keir Fraser wrote: > >> @@ -540,6 +541,14 @@ static void plt_overflow(void *unused) > >> plt_wrap = __read_platform_stime(plt_stamp64 + plt_mask + 1); > >> if ( ABS(plt_wrap - now) > ABS(plt_now - now) ) > >> break; > >> + rdtscll(tsc); > >> + printk("XXX plt_overflow: plt_now=%"PRIx64" plt_wrap=%"PRIx64 > >> + " now=%"PRIx64" old_stamp=%"PRIx64" new_stamp=%"PRIx64 > >> + " plt_stamp64=%"PRIx64" plt_mask=%"PRIx64 > >> + " tsc=%"PRIx64" tsc_stamp=%"PRIx64"\n", > >> + plt_now, plt_wrap, now, old_stamp, plt_stamp, plt_stamp64, > >> + plt_mask, tsc, this_cpu(cpu_time).local_tsc_stamp); > >> + break; > > > > Is the break here, making the following update to plt_stamp64 dead code > > deliberate? > > Yes, it's a hack to disable the timer-has-apparently-wrapped workaround. OK, good. I wonder if this explains some of the issues which have been plaguing Debian Squeeze (4.0 based) for a while now. I'll see if I can get someone there to give it a go. Ian. > > -- Keir > > >> plt_stamp64 += plt_mask + 1; > >> } > >> if ( i != 0 ) > > > > Ian. > > > > > > From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-devel] Re: Xen 4 TSC problems Date: Thu, 18 Oct 2012 10:56:22 +0200 Message-ID: References: <1350549232.28188.39.camel@dagon.hellion.org.uk> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <1350549232.28188.39.camel@dagon.hellion.org.uk> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xen.org Errors-To: xen-users-bounces@lists.xen.org To: Ian Campbell Cc: Jeremy Fitzhardinge , "xen-devel@lists.xensource.com" , Dan Magenheimer , Olivier Hanesse , Keir Fraser , Jan Beulich , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 18 October 2012 10:33, Ian Campbell wrote: > On Thu, 2012-10-18 at 08:55 +0100, Keir Fraser wrote: >> On 18/10/2012 08:40, "Ian Campbell" wrote: >> >> > On Wed, 2012-10-17 at 17:15 +0100, Keir Fraser wrote: >> >> @@ -540,6 +541,14 @@ static void plt_overflow(void *unused) >> >> plt_wrap = __read_platform_stime(plt_stamp64 + plt_mask + 1); >> >> if ( ABS(plt_wrap - now) > ABS(plt_now - now) ) >> >> break; >> >> + rdtscll(tsc); >> >> + printk("XXX plt_overflow: plt_now=%"PRIx64" plt_wrap=%"PRIx64 >> >> + " now=%"PRIx64" old_stamp=%"PRIx64" new_stamp=%"PRIx64 >> >> + " plt_stamp64=%"PRIx64" plt_mask=%"PRIx64 >> >> + " tsc=%"PRIx64" tsc_stamp=%"PRIx64"\n", >> >> + plt_now, plt_wrap, now, old_stamp, plt_stamp, plt_stamp64, >> >> + plt_mask, tsc, this_cpu(cpu_time).local_tsc_stamp); >> >> + break; >> > >> > Is the break here, making the following update to plt_stamp64 dead code >> > deliberate? >> >> Yes, it's a hack to disable the timer-has-apparently-wrapped workaround. > > OK, good. > > I wonder if this explains some of the issues which have been plaguing > Debian Squeeze (4.0 based) for a while now. I'll see if I can get > someone there to give it a go. If that patch works debian kernel maintainers can be advised if they can include that patch and release a new kernel working for squeeze. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Ian Campbell Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Thu, 18 Oct 2012 10:36:30 +0100 Message-ID: <1350552990.2460.89.camel@zakaz.uk.xensource.com> References: <1350549232.28188.39.camel@dagon.hellion.org.uk> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro Cc: Jeremy Fitzhardinge , "xen-devel@lists.xensource.com" , Dan Magenheimer , Olivier Hanesse , Keir Fraser , Jan Beulich , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On Thu, 2012-10-18 at 09:56 +0100, Mauro wrote: > On 18 October 2012 10:33, Ian Campbell wrote: > > On Thu, 2012-10-18 at 08:55 +0100, Keir Fraser wrote: > >> On 18/10/2012 08:40, "Ian Campbell" wrote: > >> > >> > On Wed, 2012-10-17 at 17:15 +0100, Keir Fraser wrote: > >> >> @@ -540,6 +541,14 @@ static void plt_overflow(void *unused) > >> >> plt_wrap = __read_platform_stime(plt_stamp64 + plt_mask + 1); > >> >> if ( ABS(plt_wrap - now) > ABS(plt_now - now) ) > >> >> break; > >> >> + rdtscll(tsc); > >> >> + printk("XXX plt_overflow: plt_now=%"PRIx64" plt_wrap=%"PRIx64 > >> >> + " now=%"PRIx64" old_stamp=%"PRIx64" new_stamp=%"PRIx64 > >> >> + " plt_stamp64=%"PRIx64" plt_mask=%"PRIx64 > >> >> + " tsc=%"PRIx64" tsc_stamp=%"PRIx64"\n", > >> >> + plt_now, plt_wrap, now, old_stamp, plt_stamp, plt_stamp64, > >> >> + plt_mask, tsc, this_cpu(cpu_time).local_tsc_stamp); > >> >> + break; > >> > > >> > Is the break here, making the following update to plt_stamp64 dead code > >> > deliberate? > >> > >> Yes, it's a hack to disable the timer-has-apparently-wrapped workaround. > > > > OK, good. > > > > I wonder if this explains some of the issues which have been plaguing > > Debian Squeeze (4.0 based) for a while now. I'll see if I can get > > someone there to give it a go. > > If that patch works debian kernel maintainers can be advised if they > can include that patch and release a new kernel working for squeeze. AFAIK this is a debug patch, not something we would deploy as is but it should give us the information required to work out a real fix. Ian From mboxrd@z Thu Jan 1 00:00:00 1970 From: Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Thu, 18 Oct 2012 13:45:43 +0000 Message-ID: References: <1350546021.28188.20.camel@dagon.hellion.org.uk> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <1350546021.28188.20.camel@dagon.hellion.org.uk> Content-Language: en-US List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Ian.Campbell@citrix.com, keir@xen.org Cc: jeremy@goop.org, xen-devel@lists.xensource.com, dan.magenheimer@oracle.com, mrsanna1@gmail.com, olivier.hanesse@gmail.com, JBeulich@suse.com, xen-users@lists.xensource.com, mark@campbell-lange.net List-Id: xen-devel@lists.xenproject.org in the meantime, it would be cool to have a kernel boot parameter that could disable this wrapping' correction' ? like > -----Original Message----- > From: xen-devel-bounces@lists.xen.org [mailto:xen-devel- > bounces@lists.xen.org] On Behalf Of Ian Campbell > Sent: Thursday, October 18, 2012 9:40 AM > To: Keir Fraser > Cc: Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Dan > Magenheimer; Mauro; Olivier Hanesse; Jan Beulich; Xen Users; Mark Adams > Subject: Re: [Xen-devel] [Xen-users] Re: Xen 4 TSC problems > > On Wed, 2012-10-17 at 17:15 +0100, Keir Fraser wrote: > > @@ -540,6 +541,14 @@ static void plt_overflow(void *unused) > > plt_wrap = __read_platform_stime(plt_stamp64 + plt_mask + 1); > > if ( ABS(plt_wrap - now) > ABS(plt_now - now) ) > > break; > > + rdtscll(tsc); > > + printk("XXX plt_overflow: plt_now=%"PRIx64" plt_wrap=%"PRIx64 > > + " now=%"PRIx64" old_stamp=%"PRIx64" new_stamp=%"PRIx64 > > + " plt_stamp64=%"PRIx64" plt_mask=%"PRIx64 > > + " tsc=%"PRIx64" tsc_stamp=%"PRIx64"\n", > > + plt_now, plt_wrap, now, old_stamp, plt_stamp, plt_stamp64, > > + plt_mask, tsc, this_cpu(cpu_time).local_tsc_stamp); > > + break; > > Is the break here, making the following update to plt_stamp64 dead code > deliberate? > > > plt_stamp64 += plt_mask + 1; > > } > > if ( i != 0 ) > > Ian. > > > > _______________________________________________ > Xen-devel mailing list > Xen-devel@lists.xen.org > http://lists.xen.org/xen-devel From mboxrd@z Thu Jan 1 00:00:00 1970 From: Keir Fraser Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Thu, 18 Oct 2012 17:43:26 +0100 Message-ID: References: Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Philippe.Simonet@swisscom.com, Ian.Campbell@citrix.com Cc: jeremy@goop.org, xen-devel@lists.xensource.com, dan.magenheimer@oracle.com, mrsanna1@gmail.com, olivier.hanesse@gmail.com, JBeulich@suse.com, xen-users@lists.xensource.com, mark@campbell-lange.net List-Id: xen-devel@lists.xenproject.org We have no idea yet whether this workaround even does any good. -- Keir On 18/10/2012 14:45, "Philippe.Simonet@swisscom.com" wrote: > in the meantime, it would be cool to have a kernel boot parameter that could > disable this wrapping' > correction' ? like > >> -----Original Message----- >> From: xen-devel-bounces@lists.xen.org [mailto:xen-devel- >> bounces@lists.xen.org] On Behalf Of Ian Campbell >> Sent: Thursday, October 18, 2012 9:40 AM >> To: Keir Fraser >> Cc: Jeremy Fitzhardinge; xen-devel@lists.xensource.com; Dan >> Magenheimer; Mauro; Olivier Hanesse; Jan Beulich; Xen Users; Mark Adams >> Subject: Re: [Xen-devel] [Xen-users] Re: Xen 4 TSC problems >> >> On Wed, 2012-10-17 at 17:15 +0100, Keir Fraser wrote: >>> @@ -540,6 +541,14 @@ static void plt_overflow(void *unused) >>> plt_wrap = __read_platform_stime(plt_stamp64 + plt_mask + 1); >>> if ( ABS(plt_wrap - now) > ABS(plt_now - now) ) >>> break; >>> + rdtscll(tsc); >>> + printk("XXX plt_overflow: plt_now=%"PRIx64" plt_wrap=%"PRIx64 >>> + " now=%"PRIx64" old_stamp=%"PRIx64" new_stamp=%"PRIx64 >>> + " plt_stamp64=%"PRIx64" plt_mask=%"PRIx64 >>> + " tsc=%"PRIx64" tsc_stamp=%"PRIx64"\n", >>> + plt_now, plt_wrap, now, old_stamp, plt_stamp, plt_stamp64, >>> + plt_mask, tsc, this_cpu(cpu_time).local_tsc_stamp); >>> + break; >> >> Is the break here, making the following update to plt_stamp64 dead code >> deliberate? >> >>> plt_stamp64 += plt_mask + 1; >>> } >>> if ( i != 0 ) >> >> Ian. >> >> >> >> _______________________________________________ >> Xen-devel mailing list >> Xen-devel@lists.xen.org >> http://lists.xen.org/xen-devel From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-devel] Re: Xen 4 TSC problems Date: Sun, 21 Oct 2012 22:52:32 +0200 Message-ID: References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> <507C226A02000078000A1657@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <507C226A02000078000A1657@nat28.tlf.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xen.org Errors-To: xen-users-bounces@lists.xen.org To: Jan Beulich Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Dan Magenheimer , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 15 October 2012 14:49, Jan Beulich wrote: >>>> On 15.10.12 at 13:24, Mauro wrote: >> I have the problem on this hardware type: >> >> Hp Proliant DL580 G5 with four Intel(R) Xeon(R) CPU E7330 @ 2.40GHz. >> It seem that >> GRUB_CMDLINE_XEN="clocksource=pit cpuidle=0" >> put in in /etc/default/grup (I use linux debian) >> solves the problem for me. > > Did you check whether either or both options on their own also > make the problem go away? It seems that with debian squeeze on my HP Proliant Dl 580 G5 servers is sufficient to use GRUB_CMDLINE_XEN="cpuidle=0". Is from about 20 days that I have no clock jumps. Before I had a clock jump every week. Hope this is the final workaround for me. From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Jan Beulich" Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Mon, 22 Oct 2012 07:54:49 +0100 Message-ID: <508509D902000078000A2DDA@nat28.tlf.novell.com> References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> <507C226A02000078000A1657@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: Content-Disposition: inline List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Dan Magenheimer , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org >>> On 21.10.12 at 22:52, Mauro wrote: > On 15 October 2012 14:49, Jan Beulich wrote: >>>>> On 15.10.12 at 13:24, Mauro wrote: >>> I have the problem on this hardware type: >>> >>> Hp Proliant DL580 G5 with four Intel(R) Xeon(R) CPU E7330 @ 2.40GHz. >>> It seem that >>> GRUB_CMDLINE_XEN="clocksource=pit cpuidle=0" >>> put in in /etc/default/grup (I use linux debian) >>> solves the problem for me. >> >> Did you check whether either or both options on their own also >> make the problem go away? > > It seems that with debian squeeze on my HP Proliant Dl 580 G5 servers > is sufficient to use > GRUB_CMDLINE_XEN="cpuidle=0". > Is from about 20 days that I have no clock jumps. > Before I had a clock jump every week. > Hope this is the final workaround for me. So what's the contents of /proc/cpuinfo (any one CPU suffices) under a native recent kernel on that system? The most likely issue here is that we're mis-identifying the CPU as having an always running APIC timer (ARAT)... For a second, less intrusive try: Could you replace "cpuidle=0" with "max_cstate=1" (assuming the former didn't meanwhile turn out not to cure the problem)? If that works too (expected), try "max_cstate=2" followed eventually by "max_cstate=2 local_apic_timer_c2_ok". Jan From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-devel] Re: Xen 4 TSC problems Date: Mon, 22 Oct 2012 11:17:46 +0200 Message-ID: References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> <507C226A02000078000A1657@nat28.tlf.novell.com> <508509D902000078000A2DDA@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <508509D902000078000A2DDA@nat28.tlf.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xen.org Errors-To: xen-users-bounces@lists.xen.org To: Jan Beulich Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Dan Magenheimer , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 22 October 2012 08:54, Jan Beulich wrote: >>>> On 21.10.12 at 22:52, Mauro wrote: >> On 15 October 2012 14:49, Jan Beulich wrote: >>>>>> On 15.10.12 at 13:24, Mauro wrote: >>>> I have the problem on this hardware type: >>>> >>>> Hp Proliant DL580 G5 with four Intel(R) Xeon(R) CPU E7330 @ 2.40GHz. >>>> It seem that >>>> GRUB_CMDLINE_XEN="clocksource=pit cpuidle=0" >>>> put in in /etc/default/grup (I use linux debian) >>>> solves the problem for me. >>> >>> Did you check whether either or both options on their own also >>> make the problem go away? >> >> It seems that with debian squeeze on my HP Proliant Dl 580 G5 servers >> is sufficient to use >> GRUB_CMDLINE_XEN="cpuidle=0". >> Is from about 20 days that I have no clock jumps. >> Before I had a clock jump every week. >> Hope this is the final workaround for me. > > So what's the contents of /proc/cpuinfo (any one CPU suffices) > under a native recent kernel on that system? The most likely > issue here is that we're mis-identifying the CPU as having an > always running APIC timer (ARAT)... uname -a Linux xen-p01 2.6.32-5-xen-amd64 #1 SMP Sun Sep 23 13:49:30 UTC 2012 x86_64 GNU/Linux cat /proc/cpuinfo processor : 0 vendor_id : GenuineIntel cpu family : 6 model : 15 model name : Intel(R) Xeon(R) CPU E7330 @ 2.40GHz stepping : 11 cpu MHz : 2400.176 cache size : 3072 KB fpu : yes fpu_exception : yes cpuid level : 10 wp : yes flags : fpu de tsc msr pae mce cx8 apic sep mtrr mca cmov pat clflush acpi mmx fxsr sse sse2 ss ht syscall nx lm constant_tsc rep_good aperfmperf pni est ssse3 cx16 hypervisor lahf_lm bogomips : 4800.35 clflush size : 64 cache_alignment : 64 address sizes : 40 bits physical, 48 bits virtual power management: > > For a second, less intrusive try: Could you replace "cpuidle=0" > with "max_cstate=1" (assuming the former didn't meanwhile > turn out not to cure the problem)? If that works too (expected), > try "max_cstate=2" followed eventually by > "max_cstate=2 local_apic_timer_c2_ok". I'll try but to say that it works I've to wait at least two weeks. From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Jan Beulich" Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Mon, 22 Oct 2012 10:27:08 +0100 Message-ID: <50852D8C02000078000A2E48@nat28.tlf.novell.com> References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> <507C226A02000078000A1657@nat28.tlf.novell.com> <508509D902000078000A2DDA@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: Content-Disposition: inline List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Dan Magenheimer , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org >>> On 22.10.12 at 11:17, Mauro wrote: > On 22 October 2012 08:54, Jan Beulich wrote: >>>>> On 21.10.12 at 22:52, Mauro wrote: >>> On 15 October 2012 14:49, Jan Beulich wrote: >> So what's the contents of /proc/cpuinfo (any one CPU suffices) >> under a native recent kernel on that system? The most likely >> issue here is that we're mis-identifying the CPU as having an >> always running APIC timer (ARAT)... > > uname -a > > Linux xen-p01 2.6.32-5-xen-amd64 #1 SMP Sun Sep 23 13:49:30 UTC 2012 > x86_64 GNU/Linux I had specifically asked to do this under a _native_ kernel. > cat /proc/cpuinfo > > processor : 0 > vendor_id : GenuineIntel > cpu family : 6 > model : 15 > model name : Intel(R) Xeon(R) CPU E7330 @ 2.40GHz > stepping : 11 > cpu MHz : 2400.176 > cache size : 3072 KB > fpu : yes > fpu_exception : yes > cpuid level : 10 > wp : yes > flags : fpu de tsc msr pae mce cx8 apic sep mtrr mca cmov > pat clflush acpi mmx fxsr sse sse2 ss ht syscall nx lm constant_tsc > rep_good aperfmperf pni est ssse3 cx16 hypervisor lahf_lm > bogomips : 4800.35 > clflush size : 64 > cache_alignment : 64 > address sizes : 40 bits physical, 48 bits virtual > power management: > >> >> For a second, less intrusive try: Could you replace "cpuidle=0" >> with "max_cstate=1" (assuming the former didn't meanwhile >> turn out not to cure the problem)? If that works too (expected), >> try "max_cstate=2" followed eventually by >> "max_cstate=2 local_apic_timer_c2_ok". > > I'll try but to say that it works I've to wait at least two weeks. I understand that this takes quite a bit of time. Jan From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-devel] Re: Xen 4 TSC problems Date: Mon, 22 Oct 2012 12:40:16 +0200 Message-ID: References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> <507C226A02000078000A1657@nat28.tlf.novell.com> <508509D902000078000A2DDA@nat28.tlf.novell.com> <50852D8C02000078000A2E48@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <50852D8C02000078000A2E48@nat28.tlf.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-users-bounces@lists.xen.org Errors-To: xen-users-bounces@lists.xen.org To: Jan Beulich Cc: Jeremy Fitzhardinge , xen-devel@lists.xensource.com, Keir Fraser , Dan Magenheimer , Olivier Hanesse , Keir Fraser , Xen Users , Mark Adams List-Id: xen-devel@lists.xenproject.org On 22 October 2012 11:27, Jan Beulich wrote: >>>> On 22.10.12 at 11:17, Mauro wrote: >> On 22 October 2012 08:54, Jan Beulich wrote: >>>>>> On 21.10.12 at 22:52, Mauro wrote: >>>> On 15 October 2012 14:49, Jan Beulich wrote: >>> So what's the contents of /proc/cpuinfo (any one CPU suffices) >>> under a native recent kernel on that system? The most likely >>> issue here is that we're mis-identifying the CPU as having an >>> always running APIC timer (ARAT)... >> >> uname -a >> >> Linux xen-p01 2.6.32-5-xen-amd64 #1 SMP Sun Sep 23 13:49:30 UTC 2012 >> x86_64 GNU/Linux > > I had specifically asked to do this under a _native_ kernel. sorry for my ignorance, what does it mean native_kernel. From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Jan Beulich" Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Mon, 22 Oct 2012 13:06:58 +0100 Message-ID: <5085530202000078000A2F12@nat28.tlf.novell.com> References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> <507C226A02000078000A1657@nat28.tlf.novell.com> <508509D902000078000A2DDA@nat28.tlf.novell.com> <50852D8C02000078000A2E48@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: Content-Disposition: inline List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro Cc: Jeremy Fitzhardinge , Keir Fraser , Dan Magenheimer , xen-devel , Olivier Hanesse , Mark Adams List-Id: xen-devel@lists.xenproject.org >>> On 22.10.12 at 12:40, Mauro wrote: > On 22 October 2012 11:27, Jan Beulich wrote: >>>>> On 22.10.12 at 11:17, Mauro wrote: >>> On 22 October 2012 08:54, Jan Beulich wrote: >>>>>>> On 21.10.12 at 22:52, Mauro wrote: >>>>> On 15 October 2012 14:49, Jan Beulich wrote: >>>> So what's the contents of /proc/cpuinfo (any one CPU suffices) >>>> under a native recent kernel on that system? The most likely >>>> issue here is that we're mis-identifying the CPU as having an >>>> always running APIC timer (ARAT)... >>> >>> uname -a >>> >>> Linux xen-p01 2.6.32-5-xen-amd64 #1 SMP Sun Sep 23 13:49:30 UTC 2012 >>> x86_64 GNU/Linux >> >> I had specifically asked to do this under a _native_ kernel. > > sorry for my ignorance, what does it mean native_kernel. A kernel run with no Xen underneath it. Jan From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Jan Beulich" Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Tue, 23 Oct 2012 08:58:04 +0100 Message-ID: <50866A2C02000078000A356D@nat28.tlf.novell.com> References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> <507C226A02000078000A1657@nat28.tlf.novell.com> <508509D902000078000A2DDA@nat28.tlf.novell.com> <50852D8C02000078000A2E48@nat28.tlf.novell.com> <5085530202000078000A2F12@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: Content-Disposition: inline List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro Cc: xen-devel List-Id: xen-devel@lists.xenproject.org >>> On 23.10.12 at 09:18, Mauro wrote: >> A kernel run with no Xen underneath it. > > Here is: > > uname -a > Linux xen-p02 2.6.32-5-amd64 #1 SMP Sun Sep 23 10:07:46 UTC 2012 > x86_64 GNU/Linux I'm sorry to say that, but 2.6.32 is nowhere close to "recent" (as I had asked for). The thing is that the (unfortunately incomplete) hypervisor log you sent earlier leaves open whether we mis-detect ARAT on that system (the relevant HPET message is info level, but you had "loglvl=warning" in place), so we can't be certain of either fact (ARAT actually being reported by the CPU as well as whether HPET broadcast is getting set up). Irrespective of that it would also be useful to know whether the native kernel (and in particular its CPU idle management) work on that system, and which C-states it actually makes use of. So getting us the contents of the respective sysfs nodes would also be helpful for reference. Jan > cat /proc/cpuinfo > > processor : 0 > vendor_id : GenuineIntel > cpu family : 6 > model : 15 > model name : Intel(R) Xeon(R) CPU E7330 @ 2.40GHz > stepping : 11 > cpu MHz : 2399.822 > cache size : 3072 KB > physical id : 0 > siblings : 4 > core id : 0 > cpu cores : 4 > apicid : 0 > initial apicid : 0 > fpu : yes > fpu_exception : yes > cpuid level : 10 > wp : yes > flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge > mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe > syscall nx lm constant_tsc arch_perfmon pebs bts rep_good aperfmperf > pni dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm dca lahf_lm > tpr_shadow vnmi flexpriority > bogomips : 4799.64 > clflush size : 64 > cache_alignment : 64 > address sizes : 40 bits physical, 48 bits virtual > power management: From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Tue, 23 Oct 2012 10:40:50 +0200 Message-ID: References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> <507C226A02000078000A1657@nat28.tlf.novell.com> <508509D902000078000A2DDA@nat28.tlf.novell.com> <50852D8C02000078000A2E48@nat28.tlf.novell.com> <5085530202000078000A2F12@nat28.tlf.novell.com> <50866A2C02000078000A356D@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <50866A2C02000078000A356D@nat28.tlf.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Jan Beulich Cc: xen-devel List-Id: xen-devel@lists.xenproject.org On 23 October 2012 09:58, Jan Beulich wrote: >>>> On 23.10.12 at 09:18, Mauro wrote: >>> A kernel run with no Xen underneath it. >> >> Here is: >> >> uname -a >> Linux xen-p02 2.6.32-5-amd64 #1 SMP Sun Sep 23 10:07:46 UTC 2012 >> x86_64 GNU/Linux > > I'm sorry to say that, but 2.6.32 is nowhere close to "recent" (as > I had asked for). Sorry I'm using squeeze in production servers and I don't have a test machine on which install wheezy. There's nothing else I can do? As reported before with the workaround cpuidle=0 I don't have any problem now. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Tue, 23 Oct 2012 09:19:53 +0200 Message-ID: References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> <507C226A02000078000A1657@nat28.tlf.novell.com> <508509D902000078000A2DDA@nat28.tlf.novell.com> <50852D8C02000078000A2E48@nat28.tlf.novell.com> <5085530202000078000A2F12@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <5085530202000078000A2F12@nat28.tlf.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Jan Beulich Cc: Jeremy Fitzhardinge , Keir Fraser , Dan Magenheimer , xen-devel , Olivier Hanesse , Mark Adams List-Id: xen-devel@lists.xenproject.org > A kernel run with no Xen underneath it. Here is: uname -a Linux xen-p02 2.6.32-5-amd64 #1 SMP Sun Sep 23 10:07:46 UTC 2012 x86_64 GNU/Linux cat /proc/cpuinfo processor : 0 vendor_id : GenuineIntel cpu family : 6 model : 15 model name : Intel(R) Xeon(R) CPU E7330 @ 2.40GHz stepping : 11 cpu MHz : 2399.822 cache size : 3072 KB physical id : 0 siblings : 4 core id : 0 cpu cores : 4 apicid : 0 initial apicid : 0 fpu : yes fpu_exception : yes cpuid level : 10 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good aperfmperf pni dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm dca lahf_lm tpr_shadow vnmi flexpriority bogomips : 4799.64 clflush size : 64 cache_alignment : 64 address sizes : 40 bits physical, 48 bits virtual power management: From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Jan Beulich" Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Tue, 23 Oct 2012 09:50:13 +0100 Message-ID: <5086766502000078000A35D4@nat28.tlf.novell.com> References: <65395f62-74e5-4910-b701-8df629c2ce3b@default> <20120930151314.GU8912@reaktio.net> <507C024602000078000A155C@nat28.tlf.novell.com> <507C226A02000078000A1657@nat28.tlf.novell.com> <508509D902000078000A2DDA@nat28.tlf.novell.com> <50852D8C02000078000A2E48@nat28.tlf.novell.com> <5085530202000078000A2F12@nat28.tlf.novell.com> <50866A2C02000078000A356D@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: Content-Disposition: inline List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro Cc: xen-devel List-Id: xen-devel@lists.xenproject.org >>> On 23.10.12 at 10:40, Mauro wrote: > On 23 October 2012 09:58, Jan Beulich wrote: >>>>> On 23.10.12 at 09:18, Mauro wrote: >>>> A kernel run with no Xen underneath it. >>> >>> Here is: >>> >>> uname -a >>> Linux xen-p02 2.6.32-5-amd64 #1 SMP Sun Sep 23 10:07:46 UTC 2012 >>> x86_64 GNU/Linux >> >> I'm sorry to say that, but 2.6.32 is nowhere close to "recent" (as >> I had asked for). > > Sorry I'm using squeeze in production servers and I don't have a test > machine on which install wheezy. And can't/don't want to install a self-built kernel? > There's nothing else I can do? I told you yesterday what less invasive command line options you could try. Plus in an earlier mail today I also asked for specific information on which C-states the native kernel uses. If all you've got is 2.6.32, obtaining the information there is better than nothing. Jan > As reported before with the workaround cpuidle=0 I don't have any problem > now. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Konrad Rzeszutek Wilk Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Tue, 23 Oct 2012 07:50:02 -0400 Message-ID: <20121023115001.GA23471@localhost.localdomain> References: <508509D902000078000A2DDA@nat28.tlf.novell.com> <50852D8C02000078000A2E48@nat28.tlf.novell.com> <5085530202000078000A2F12@nat28.tlf.novell.com> <50866A2C02000078000A356D@nat28.tlf.novell.com> <5086766502000078000A35D4@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: Content-Disposition: inline In-Reply-To: <5086766502000078000A35D4@nat28.tlf.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Jan Beulich Cc: Mauro , xen-devel List-Id: xen-devel@lists.xenproject.org On Tue, Oct 23, 2012 at 09:50:13AM +0100, Jan Beulich wrote: > >>> On 23.10.12 at 10:40, Mauro wrote: > > On 23 October 2012 09:58, Jan Beulich wrote: > >>>>> On 23.10.12 at 09:18, Mauro wrote: > >>>> A kernel run with no Xen underneath it. > >>> > >>> Here is: > >>> > >>> uname -a > >>> Linux xen-p02 2.6.32-5-amd64 #1 SMP Sun Sep 23 10:07:46 UTC 2012 > >>> x86_64 GNU/Linux > >> > >> I'm sorry to say that, but 2.6.32 is nowhere close to "recent" (as > >> I had asked for). > > > > Sorry I'm using squeeze in production servers and I don't have a test > > machine on which install wheezy. > > And can't/don't want to install a self-built kernel? You could also boot one of those Live Image kernels. Like an Fedora or Ubuntu and just capture this. That way you don't over-write anything. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Tue, 23 Oct 2012 16:07:32 +0200 Message-ID: References: <508509D902000078000A2DDA@nat28.tlf.novell.com> <50852D8C02000078000A2E48@nat28.tlf.novell.com> <5085530202000078000A2F12@nat28.tlf.novell.com> <50866A2C02000078000A356D@nat28.tlf.novell.com> <5086766502000078000A35D4@nat28.tlf.novell.com> <20121023115001.GA23471@localhost.localdomain> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <20121023115001.GA23471@localhost.localdomain> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Konrad Rzeszutek Wilk Cc: xen-devel List-Id: xen-devel@lists.xenproject.org On 23 October 2012 13:50, Konrad Rzeszutek Wilk wrote: > On Tue, Oct 23, 2012 at 09:50:13AM +0100, Jan Beulich wrote: >> >>> On 23.10.12 at 10:40, Mauro wrote: >> > On 23 October 2012 09:58, Jan Beulich wrote: >> >>>>> On 23.10.12 at 09:18, Mauro wrote: >> >>>> A kernel run with no Xen underneath it. >> >>> >> >>> Here is: >> >>> >> >>> uname -a >> >>> Linux xen-p02 2.6.32-5-amd64 #1 SMP Sun Sep 23 10:07:46 UTC 2012 >> >>> x86_64 GNU/Linux >> >> >> >> I'm sorry to say that, but 2.6.32 is nowhere close to "recent" (as >> >> I had asked for). >> > >> > Sorry I'm using squeeze in production servers and I don't have a test >> > machine on which install wheezy. >> >> And can't/don't want to install a self-built kernel? > > You could also boot one of those Live Image kernels. Like an Fedora or > Ubuntu and just capture this. That way you don't over-write anything. Ok, I'll try the live image. Today I had another clock jump so cpuidle=0 doesn't work. I'll stay using clocksource= pit and cpuidle =0 for a while to see if they together work. But...how to know what C states the kernel uses? From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Jan Beulich" Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Tue, 23 Oct 2012 15:43:20 +0100 Message-ID: <5086C92802000078000A3829@nat28.tlf.novell.com> References: <508509D902000078000A2DDA@nat28.tlf.novell.com> <50852D8C02000078000A2E48@nat28.tlf.novell.com> <5085530202000078000A2F12@nat28.tlf.novell.com> <50866A2C02000078000A356D@nat28.tlf.novell.com> <5086766502000078000A35D4@nat28.tlf.novell.com> <20121023115001.GA23471@localhost.localdomain> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: Content-Disposition: inline List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro Cc: Konrad Rzeszutek Wilk , xen-devel List-Id: xen-devel@lists.xenproject.org >>> On 23.10.12 at 16:07, Mauro wrote: > On 23 October 2012 13:50, Konrad Rzeszutek Wilk wrote: >> On Tue, Oct 23, 2012 at 09:50:13AM +0100, Jan Beulich wrote: >>> >>> On 23.10.12 at 10:40, Mauro wrote: >>> > On 23 October 2012 09:58, Jan Beulich wrote: >>> >>>>> On 23.10.12 at 09:18, Mauro wrote: >>> >>>> A kernel run with no Xen underneath it. >>> >>> >>> >>> Here is: >>> >>> >>> >>> uname -a >>> >>> Linux xen-p02 2.6.32-5-amd64 #1 SMP Sun Sep 23 10:07:46 UTC 2012 >>> >>> x86_64 GNU/Linux >>> >> >>> >> I'm sorry to say that, but 2.6.32 is nowhere close to "recent" (as >>> >> I had asked for). >>> > >>> > Sorry I'm using squeeze in production servers and I don't have a test >>> > machine on which install wheezy. >>> >>> And can't/don't want to install a self-built kernel? >> >> You could also boot one of those Live Image kernels. Like an Fedora or >> Ubuntu and just capture this. That way you don't over-write anything. > > Ok, I'll try the live image. > Today I had another clock jump so cpuidle=0 doesn't work. > I'll stay using clocksource= pit and cpuidle =0 for a while to see if > they together work. > But...how to know what C states the kernel uses? If "cpuidle=0" alone doesn't work, your problem is not C-state related, and you don't need to look up how much of it the native kernel uses. Jan From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Tue, 23 Oct 2012 16:46:13 +0200 Message-ID: References: <508509D902000078000A2DDA@nat28.tlf.novell.com> <50852D8C02000078000A2E48@nat28.tlf.novell.com> <5085530202000078000A2F12@nat28.tlf.novell.com> <50866A2C02000078000A356D@nat28.tlf.novell.com> <5086766502000078000A35D4@nat28.tlf.novell.com> <20121023115001.GA23471@localhost.localdomain> <5086C92802000078000A3829@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <5086C92802000078000A3829@nat28.tlf.novell.com> List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Jan Beulich Cc: xen-devel List-Id: xen-devel@lists.xenproject.org On 23 October 2012 16:43, Jan Beulich wrote: >>>> On 23.10.12 at 16:07, Mauro wrote: >> On 23 October 2012 13:50, Konrad Rzeszutek Wilk wrote: >>> On Tue, Oct 23, 2012 at 09:50:13AM +0100, Jan Beulich wrote: >>>> >>> On 23.10.12 at 10:40, Mauro wrote: >>>> > On 23 October 2012 09:58, Jan Beulich wrote: >>>> >>>>> On 23.10.12 at 09:18, Mauro wrote: >>>> >>>> A kernel run with no Xen underneath it. >>>> >>> >>>> >>> Here is: >>>> >>> >>>> >>> uname -a >>>> >>> Linux xen-p02 2.6.32-5-amd64 #1 SMP Sun Sep 23 10:07:46 UTC 2012 >>>> >>> x86_64 GNU/Linux >>>> >> >>>> >> I'm sorry to say that, but 2.6.32 is nowhere close to "recent" (as >>>> >> I had asked for). >>>> > >>>> > Sorry I'm using squeeze in production servers and I don't have a test >>>> > machine on which install wheezy. >>>> >>>> And can't/don't want to install a self-built kernel? >>> >>> You could also boot one of those Live Image kernels. Like an Fedora or >>> Ubuntu and just capture this. That way you don't over-write anything. >> >> Ok, I'll try the live image. >> Today I had another clock jump so cpuidle=0 doesn't work. >> I'll stay using clocksource= pit and cpuidle =0 for a while to see if >> they together work. >> But...how to know what C states the kernel uses? > > If "cpuidle=0" alone doesn't work, your problem is not C-state > related, and you don't need to look up how much of it the native > kernel uses. Ok, cpuidle=0 however is to be used because also clocksource=pit alone doesn't work. Now I'll use both params and see what happens. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mauro Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Tue, 23 Oct 2012 17:34:04 +0200 Message-ID: References: <508509D902000078000A2DDA@nat28.tlf.novell.com> <50852D8C02000078000A2E48@nat28.tlf.novell.com> <5085530202000078000A2F12@nat28.tlf.novell.com> <50866A2C02000078000A356D@nat28.tlf.novell.com> <5086766502000078000A35D4@nat28.tlf.novell.com> <20121023115001.GA23471@localhost.localdomain> <5086C92802000078000A3829@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Jan Beulich Cc: xen-devel List-Id: xen-devel@lists.xenproject.org On 23 October 2012 16:46, Mauro wrote: > On 23 October 2012 16:43, Jan Beulich wrote: >>>>> On 23.10.12 at 16:07, Mauro wrote: >>> On 23 October 2012 13:50, Konrad Rzeszutek Wilk wrote: >>>> On Tue, Oct 23, 2012 at 09:50:13AM +0100, Jan Beulich wrote: >>>>> >>> On 23.10.12 at 10:40, Mauro wrote: >>>>> > On 23 October 2012 09:58, Jan Beulich wrote: >>>>> >>>>> On 23.10.12 at 09:18, Mauro wrote: >>>>> >>>> A kernel run with no Xen underneath it. >>>>> >>> >>>>> >>> Here is: >>>>> >>> >>>>> >>> uname -a >>>>> >>> Linux xen-p02 2.6.32-5-amd64 #1 SMP Sun Sep 23 10:07:46 UTC 2012 >>>>> >>> x86_64 GNU/Linux >>>>> >> >>>>> >> I'm sorry to say that, but 2.6.32 is nowhere close to "recent" (as >>>>> >> I had asked for). >>>>> > >>>>> > Sorry I'm using squeeze in production servers and I don't have a test >>>>> > machine on which install wheezy. >>>>> >>>>> And can't/don't want to install a self-built kernel? >>>> >>>> You could also boot one of those Live Image kernels. Like an Fedora or >>>> Ubuntu and just capture this. That way you don't over-write anything. >>> >>> Ok, I'll try the live image. >>> Today I had another clock jump so cpuidle=0 doesn't work. >>> I'll stay using clocksource= pit and cpuidle =0 for a while to see if >>> they together work. >>> But...how to know what C states the kernel uses? >> >> If "cpuidle=0" alone doesn't work, your problem is not C-state >> related, and you don't need to look up how much of it the native >> kernel uses. > > Ok, cpuidle=0 however is to be used because also clocksource=pit alone > doesn't work. > Now I'll use both params and see what happens. Sorry for the noise, I'm not sure if it has been really a clock jump. Retry using only cpuidle=0 and then what Jan has suggested. If you want to see the C states tell me how to do. p.s. sorry for bad english From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Jan Beulich" Subject: Re: [Xen-users] Re: Xen 4 TSC problems Date: Tue, 23 Oct 2012 16:49:24 +0100 Message-ID: <5086D8A402000078000A38B7@nat28.tlf.novell.com> References: <508509D902000078000A2DDA@nat28.tlf.novell.com> <50852D8C02000078000A2E48@nat28.tlf.novell.com> <5085530202000078000A2F12@nat28.tlf.novell.com> <50866A2C02000078000A356D@nat28.tlf.novell.com> <5086766502000078000A35D4@nat28.tlf.novell.com> <20121023115001.GA23471@localhost.localdomain> <5086C92802000078000A3829@nat28.tlf.novell.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: Content-Disposition: inline List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xen.org Errors-To: xen-devel-bounces@lists.xen.org To: Mauro Cc: xen-devel List-Id: xen-devel@lists.xenproject.org >>> On 23.10.12 at 17:34, Mauro wrote: > On 23 October 2012 16:46, Mauro wrote: >> On 23 October 2012 16:43, Jan Beulich wrote: >>>>>> On 23.10.12 at 16:07, Mauro wrote: >>>> Today I had another clock jump so cpuidle=0 doesn't work. >>>> I'll stay using clocksource= pit and cpuidle =0 for a while to see if >>>> they together work. >>>> But...how to know what C states the kernel uses? >>> >>> If "cpuidle=0" alone doesn't work, your problem is not C-state >>> related, and you don't need to look up how much of it the native >>> kernel uses. >> >> Ok, cpuidle=0 however is to be used because also clocksource=pit alone >> doesn't work. >> Now I'll use both params and see what happens. > > Sorry for the noise, I'm not sure if it has been really a clock jump. > Retry using only cpuidle=0 and then what Jan has suggested. > If you want to see the C states tell me how to do. Let's wait with that until you're settled on whether "cpuidle=0" alone works. Jan