* HPET (?) related hangs and breakage in 2.6.35,36
@ 2010-11-09 13:46 Andrew Lutomirski
2010-11-09 15:20 ` Clemens Ladisch
0 siblings, 1 reply; 13+ messages in thread
From: Andrew Lutomirski @ 2010-11-09 13:46 UTC (permalink / raw)
To: linux-kernel
Hi all-
I have a problem, possibly HPET related, that's gotten a lot worse in
2.6.35 and later. It is present in the latest Fedora 14 kernel
(2.6.35-blahblahblah) and in kernel.org's 2.6.36.
After several minutes of uptime, my system freezes in a strange way.
All of userspace and some kernel services freeze hard (no mouse, no
typing, etc.) Caps-lock works intermittently, as does (I think)
pinging. The only debug output I've been able to get is this (typed
from a blurry photo because netconsole dies along with everything
else):
[ 5478.244722] Clocksource tsc unstable (delta = 101399051316 ns)
[ 5478.367958] Switching to clocksource hpet
[ 5543.729698] BUG: soft lockup - CPU #1 stuck for 61s! [migration/1:6]
[ 5543.729743] Modules linked in: [...]
At this point, even SysRq only works some of the time (it'll work for
a few seconds then die for awhile). Banging on the keyboard doesn't
appear to help, and I'd expect it to if all that was wrong was that I
was losing wakeups.
IIRC I got the tsc unstable messages on older kernels with similarly
insanely large deltas but the system kept working.
Booting with hpet=disable not only makes the system stable but
prevents the clocksource tsc unstable message. ntpd doesn't seem to
complain about my clock, either (and on a different machine with a (I
think) particularly crappy onboard clock, ntpd complains loudly, so I
assume that ntpd would tell me if my clock didn't work right).
CPU is Intel Xeon W5320 at 2.67GHz and /proc/cpuinfo says tsc, rdtscp,
constant_tsc, nonstop_tsc, so the TSC should be good.
Motherboard is Asus P6T WS PRO rev 1.xx running a BIOS from 02/26/2009
(version 0603).
My guess is that my hpet (or its driver) is broken enough to bring
down userspace and that the kernel incorrectly trusts the hpet more
than the TSC.
--Andy
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: HPET (?) related hangs and breakage in 2.6.35,36
2010-11-09 13:46 HPET (?) related hangs and breakage in 2.6.35,36 Andrew Lutomirski
@ 2010-11-09 15:20 ` Clemens Ladisch
2010-11-09 15:38 ` Borislav Petkov
0 siblings, 1 reply; 13+ messages in thread
From: Clemens Ladisch @ 2010-11-09 15:20 UTC (permalink / raw)
To: Andrew Lutomirski; +Cc: linux-kernel
Andrew Lutomirski wrote:
> After several minutes of uptime, my system freezes in a strange way.
> All of userspace and some kernel services freeze hard (no mouse, no
> typing, etc.) Caps-lock works intermittently, as does (I think)
> pinging. The only debug output I've been able to get is this (typed
> from a blurry photo because netconsole dies along with everything
> else):
>
> [ 5478.244722] Clocksource tsc unstable (delta = 101399051316 ns)
> [ 5478.367958] Switching to clocksource hpet
> [ 5543.729698] BUG: soft lockup - CPU #1 stuck for 61s! [migration/1:6]
Looks like this one:
http://lkml.org/lkml/2010/10/26/126
Regards,
Clemens
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: HPET (?) related hangs and breakage in 2.6.35,36
2010-11-09 15:20 ` Clemens Ladisch
@ 2010-11-09 15:38 ` Borislav Petkov
2010-11-09 17:44 ` Andrew Lutomirski
0 siblings, 1 reply; 13+ messages in thread
From: Borislav Petkov @ 2010-11-09 15:38 UTC (permalink / raw)
To: Clemens Ladisch
Cc: Andrew Lutomirski, linux-kernel, Thomas Gleixner, Peter Zijlstra,
Ingo Molnar, H. Peter Anvin, x86
On Tue, Nov 09, 2010 at 04:20:56PM +0100, Clemens Ladisch wrote:
> Andrew Lutomirski wrote:
> > After several minutes of uptime, my system freezes in a strange way.
> > All of userspace and some kernel services freeze hard (no mouse, no
> > typing, etc.) Caps-lock works intermittently, as does (I think)
> > pinging. The only debug output I've been able to get is this (typed
> > from a blurry photo because netconsole dies along with everything
> > else):
> >
> > [ 5478.244722] Clocksource tsc unstable (delta = 101399051316 ns)
> > [ 5478.367958] Switching to clocksource hpet
> > [ 5543.729698] BUG: soft lockup - CPU #1 stuck for 61s! [migration/1:6]
>
> Looks like this one:
> http://lkml.org/lkml/2010/10/26/126
I don't think so: the one at the URL above is 37-rc1 specific (issue
came in during the merge window) while Andrew's problem can be
reproduced with 35 and 36, reportedly.
Anyway, adding more people to Cc.
--
Regards/Gruss,
Boris.
Advanced Micro Devices GmbH
Einsteinring 24, 85609 Dornach
General Managers: Alberto Bozzo, Andrew Bowd
Registration: Dornach, Gemeinde Aschheim, Landkreis Muenchen
Registergericht Muenchen, HRB Nr. 43632
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: HPET (?) related hangs and breakage in 2.6.35,36
2010-11-09 15:38 ` Borislav Petkov
@ 2010-11-09 17:44 ` Andrew Lutomirski
2010-11-09 18:01 ` Borislav Petkov
0 siblings, 1 reply; 13+ messages in thread
From: Andrew Lutomirski @ 2010-11-09 17:44 UTC (permalink / raw)
To: Borislav Petkov, Clemens Ladisch, Andrew Lutomirski, linux-kernel,
Thomas Gleixner, Peter Zijlstra, Ingo Molnar, H. Peter Anvin, x86
On Tue, Nov 9, 2010 at 10:38 AM, Borislav Petkov <bp@amd64.org> wrote:
> On Tue, Nov 09, 2010 at 04:20:56PM +0100, Clemens Ladisch wrote:
>> Andrew Lutomirski wrote:
>> > After several minutes of uptime, my system freezes in a strange way.
>> > All of userspace and some kernel services freeze hard (no mouse, no
>> > typing, etc.) Caps-lock works intermittently, as does (I think)
>> > pinging. The only debug output I've been able to get is this (typed
>> > from a blurry photo because netconsole dies along with everything
>> > else):
>> >
>> > [ 5478.244722] Clocksource tsc unstable (delta = 101399051316 ns)
>> > [ 5478.367958] Switching to clocksource hpet
>> > [ 5543.729698] BUG: soft lockup - CPU #1 stuck for 61s! [migration/1:6]
>>
>> Looks like this one:
>> http://lkml.org/lkml/2010/10/26/126
>
> I don't think so: the one at the URL above is 37-rc1 specific (issue
> came in during the merge window) while Andrew's problem can be
> reproduced with 35 and 36, reportedly.
>
> Anyway, adding more people to Cc.
I may have just reproduced after a couple of days of uptime it in
2.6.36 with hpet=disable (in any case, the system hung and netconsole
didn't work).
I'm currently testing the hpet_min_tick stuff, backported to 2.6.36,
with hpet on. Haven't had a problem yet, but I'll see what happens
after a few hours. So far, it said:
ACPI: HPET id: 0x8086a301 base: 0xfed00000 min tick: 128
Is there any way that a tsc delta of >100 seconds occurring awhile
after bootup on Nehalem with a single socket can real (as opposed to a
bug making the kernel think that there was a delta when there really
wasn't)?
--Andy
>
> --
> Regards/Gruss,
> Boris.
>
> Advanced Micro Devices GmbH
> Einsteinring 24, 85609 Dornach
> General Managers: Alberto Bozzo, Andrew Bowd
> Registration: Dornach, Gemeinde Aschheim, Landkreis Muenchen
> Registergericht Muenchen, HRB Nr. 43632
>
>
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: HPET (?) related hangs and breakage in 2.6.35,36
2010-11-09 17:44 ` Andrew Lutomirski
@ 2010-11-09 18:01 ` Borislav Petkov
2010-11-09 21:29 ` Andrew Lutomirski
0 siblings, 1 reply; 13+ messages in thread
From: Borislav Petkov @ 2010-11-09 18:01 UTC (permalink / raw)
To: Andrew Lutomirski
Cc: Clemens Ladisch, linux-kernel@vger.kernel.org, Thomas Gleixner,
Peter Zijlstra, Ingo Molnar, H. Peter Anvin, x86
On Tue, Nov 09, 2010 at 12:44:48PM -0500, Andrew Lutomirski wrote:
> I may have just reproduced after a couple of days of uptime it in
> 2.6.36 with hpet=disable (in any case, the system hung and netconsole
> didn't work).
>
> I'm currently testing the hpet_min_tick stuff, backported to 2.6.36,
> with hpet on. Haven't had a problem yet, but I'll see what happens
> after a few hours. So far, it said:
>
> ACPI: HPET id: 0x8086a301 base: 0xfed00000 min tick: 128
This doesn't say what your BIOS has set it to originally. For that, do
hexdump -C /sys/firmware/acpi/tables/HPET
and check what does the u16 little endian value at offset 0x35 say.
--
Regards/Gruss,
Boris.
Advanced Micro Devices GmbH
Einsteinring 24, 85609 Dornach
General Managers: Alberto Bozzo, Andrew Bowd
Registration: Dornach, Gemeinde Aschheim, Landkreis Muenchen
Registergericht Muenchen, HRB Nr. 43632
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: HPET (?) related hangs and breakage in 2.6.35,36
2010-11-09 18:01 ` Borislav Petkov
@ 2010-11-09 21:29 ` Andrew Lutomirski
2010-11-10 18:48 ` [FALSE ALARM] " Andrew Lutomirski
0 siblings, 1 reply; 13+ messages in thread
From: Andrew Lutomirski @ 2010-11-09 21:29 UTC (permalink / raw)
To: Borislav Petkov
Cc: Clemens Ladisch, linux-kernel@vger.kernel.org, Thomas Gleixner,
Peter Zijlstra, Ingo Molnar, H. Peter Anvin, x86
On Tue, Nov 9, 2010 at 1:01 PM, Borislav Petkov <bp@amd64.org> wrote:
> On Tue, Nov 09, 2010 at 12:44:48PM -0500, Andrew Lutomirski wrote:
>> I may have just reproduced after a couple of days of uptime it in
>> 2.6.36 with hpet=disable (in any case, the system hung and netconsole
>> didn't work).
>>
>> I'm currently testing the hpet_min_tick stuff, backported to 2.6.36,
>> with hpet on. Haven't had a problem yet, but I'll see what happens
>> after a few hours. So far, it said:
>>
>> ACPI: HPET id: 0x8086a301 base: 0xfed00000 min tick: 128
>
> This doesn't say what your BIOS has set it to originally. For that, do
>
> hexdump -C /sys/firmware/acpi/tables/HPET
>
> and check what does the u16 little endian value at offset 0x35 say.
0x37ee, which people have seen before. In any case, even with the
patch applied my system hung, and I still got this off netconsole
sometime around when the hang happened:
Clocksource: tsc unstable (delta = -34355296774 ns)
Switching: to clocksource hpet
Another odd symptom: none of the SysRq hotkeys I could think of did
anything *except for SysRq-b*. I have no idea what could cause that.
My system appears to pass ingo's time-warp-test (when compiled with
-O0 -- it segfaults with -O2), so I think my TSC is okay.
--Andy
^ permalink raw reply [flat|nested] 13+ messages in thread
* [FALSE ALARM] Re: HPET (?) related hangs and breakage in 2.6.35,36
2010-11-09 21:29 ` Andrew Lutomirski
@ 2010-11-10 18:48 ` Andrew Lutomirski
2010-11-10 18:50 ` Borislav Petkov
0 siblings, 1 reply; 13+ messages in thread
From: Andrew Lutomirski @ 2010-11-10 18:48 UTC (permalink / raw)
To: Borislav Petkov
Cc: Clemens Ladisch, linux-kernel@vger.kernel.org, Thomas Gleixner,
Peter Zijlstra, Ingo Molnar, H. Peter Anvin, x86
On Tue, Nov 9, 2010 at 4:29 PM, Andrew Lutomirski <luto@mit.edu> wrote:
> On Tue, Nov 9, 2010 at 1:01 PM, Borislav Petkov <bp@amd64.org> wrote:
>> On Tue, Nov 09, 2010 at 12:44:48PM -0500, Andrew Lutomirski wrote:
>>> I may have just reproduced after a couple of days of uptime it in
>>> 2.6.36 with hpet=disable (in any case, the system hung and netconsole
>>> didn't work).
>>>
>>> I'm currently testing the hpet_min_tick stuff, backported to 2.6.36,
>>> with hpet on. Haven't had a problem yet, but I'll see what happens
>>> after a few hours. So far, it said:
>>>
>>> ACPI: HPET id: 0x8086a301 base: 0xfed00000 min tick: 128
>>
>> This doesn't say what your BIOS has set it to originally. For that, do
>>
>> hexdump -C /sys/firmware/acpi/tables/HPET
>>
>> and check what does the u16 little endian value at offset 0x35 say.
>
> 0x37ee, which people have seen before. In any case, even with the
> patch applied my system hung, and I still got this off netconsole
> sometime around when the hang happened:
>
> Clocksource: tsc unstable (delta = -34355296774 ns)
> Switching: to clocksource hpet
Please disregard -- this is a bug in nouveau (or drm) not hpet. I'll
send a bug report to the maintainers.
--Andy
>
> Another odd symptom: none of the SysRq hotkeys I could think of did
> anything *except for SysRq-b*. I have no idea what could cause that.
>
> My system appears to pass ingo's time-warp-test (when compiled with
> -O0 -- it segfaults with -O2), so I think my TSC is okay.
>
> --Andy
>
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [FALSE ALARM] Re: HPET (?) related hangs and breakage in 2.6.35,36
2010-11-10 18:48 ` [FALSE ALARM] " Andrew Lutomirski
@ 2010-11-10 18:50 ` Borislav Petkov
2010-11-10 19:02 ` Andrew Lutomirski
2010-11-11 11:06 ` Roedel, Joerg
0 siblings, 2 replies; 13+ messages in thread
From: Borislav Petkov @ 2010-11-10 18:50 UTC (permalink / raw)
To: Andrew Lutomirski
Cc: Clemens Ladisch, linux-kernel@vger.kernel.org, Thomas Gleixner,
Peter Zijlstra, Ingo Molnar, H. Peter Anvin, x86,
Jörg Rödel
On Wed, Nov 10, 2010 at 01:48:00PM -0500, Andrew Lutomirski wrote:
> > Clocksource: tsc unstable (delta = -34355296774 ns)
> > Switching: to clocksource hpet
>
> Please disregard -- this is a bug in nouveau (or drm) not hpet. I'll
> send a bug report to the maintainers.
Interesting! Joerg was complaining about similar symptoms with .36 today
too.
--
Regards/Gruss,
Boris.
Advanced Micro Devices GmbH
Einsteinring 24, 85609 Dornach
General Managers: Alberto Bozzo, Andrew Bowd
Registration: Dornach, Gemeinde Aschheim, Landkreis Muenchen
Registergericht Muenchen, HRB Nr. 43632
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [FALSE ALARM] Re: HPET (?) related hangs and breakage in 2.6.35,36
2010-11-10 18:50 ` Borislav Petkov
@ 2010-11-10 19:02 ` Andrew Lutomirski
2010-11-10 20:52 ` Thomas Gleixner
2010-11-11 11:06 ` Roedel, Joerg
1 sibling, 1 reply; 13+ messages in thread
From: Andrew Lutomirski @ 2010-11-10 19:02 UTC (permalink / raw)
To: Borislav Petkov
Cc: Clemens Ladisch, linux-kernel@vger.kernel.org, Thomas Gleixner,
Peter Zijlstra, Ingo Molnar, H. Peter Anvin, x86,
Jörg Rödel
On Wed, Nov 10, 2010 at 1:50 PM, Borislav Petkov <bp@amd64.org> wrote:
> On Wed, Nov 10, 2010 at 01:48:00PM -0500, Andrew Lutomirski wrote:
>> > Clocksource: tsc unstable (delta = -34355296774 ns)
>> > Switching: to clocksource hpet
>>
>> Please disregard -- this is a bug in nouveau (or drm) not hpet. I'll
>> send a bug report to the maintainers.
>
> Interesting! Joerg was complaining about similar symptoms with .36 today
> too.
Well, there is a clocksource sort-of-bug that could cause confusion:
when something totally unrelated to clocksources goes out to lunch,
the clocksource watchdog decides that the clocksource is unstable and
complains, steering everyone toward filing the wrong bug.
tglx?
>
> --
> Regards/Gruss,
> Boris.
>
> Advanced Micro Devices GmbH
> Einsteinring 24, 85609 Dornach
> General Managers: Alberto Bozzo, Andrew Bowd
> Registration: Dornach, Gemeinde Aschheim, Landkreis Muenchen
> Registergericht Muenchen, HRB Nr. 43632
>
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [FALSE ALARM] Re: HPET (?) related hangs and breakage in 2.6.35,36
2010-11-10 19:02 ` Andrew Lutomirski
@ 2010-11-10 20:52 ` Thomas Gleixner
2010-11-10 20:56 ` Andrew Lutomirski
0 siblings, 1 reply; 13+ messages in thread
From: Thomas Gleixner @ 2010-11-10 20:52 UTC (permalink / raw)
To: Andrew Lutomirski
Cc: Borislav Petkov, Clemens Ladisch, linux-kernel@vger.kernel.org,
Peter Zijlstra, Ingo Molnar, H. Peter Anvin, x86,
Jörg Rödel
[-- Attachment #1: Type: TEXT/PLAIN, Size: 1190 bytes --]
On Wed, 10 Nov 2010, Andrew Lutomirski wrote:
> On Wed, Nov 10, 2010 at 1:50 PM, Borislav Petkov <bp@amd64.org> wrote:
> > On Wed, Nov 10, 2010 at 01:48:00PM -0500, Andrew Lutomirski wrote:
> >> > Clocksource: tsc unstable (delta = -34355296774 ns)
> >> > Switching: to clocksource hpet
> >>
> >> Please disregard -- this is a bug in nouveau (or drm) not hpet. I'll
> >> send a bug report to the maintainers.
> >
> > Interesting! Joerg was complaining about similar symptoms with .36 today
> > too.
>
> Well, there is a clocksource sort-of-bug that could cause confusion:
> when something totally unrelated to clocksources goes out to lunch,
> the clocksource watchdog decides that the clocksource is unstable and
> complains, steering everyone toward filing the wrong bug.
How should the clocksource watchdog code know that something went to
lunch? The fact that we need to monitor TSC at all is horrible enough,
adding further heuristics to detect extended lunch breaks would be
just a PITA.
Maybe we could print a different warning when we see large negative
deltas, which is the main indicator for the system being stuck for
quite a time while TSC advances happily.
Thanks,
tglx
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [FALSE ALARM] Re: HPET (?) related hangs and breakage in 2.6.35,36
2010-11-10 20:52 ` Thomas Gleixner
@ 2010-11-10 20:56 ` Andrew Lutomirski
2010-11-10 20:58 ` Thomas Gleixner
0 siblings, 1 reply; 13+ messages in thread
From: Andrew Lutomirski @ 2010-11-10 20:56 UTC (permalink / raw)
To: Thomas Gleixner
Cc: Borislav Petkov, Clemens Ladisch, linux-kernel@vger.kernel.org,
Peter Zijlstra, Ingo Molnar, H. Peter Anvin, x86,
Jörg Rödel
On Wed, Nov 10, 2010 at 3:52 PM, Thomas Gleixner <tglx@linutronix.de> wrote:
> On Wed, 10 Nov 2010, Andrew Lutomirski wrote:
>
>> On Wed, Nov 10, 2010 at 1:50 PM, Borislav Petkov <bp@amd64.org> wrote:
>> > On Wed, Nov 10, 2010 at 01:48:00PM -0500, Andrew Lutomirski wrote:
>> >> > Clocksource: tsc unstable (delta = -34355296774 ns)
>> >> > Switching: to clocksource hpet
>> >>
>> >> Please disregard -- this is a bug in nouveau (or drm) not hpet. I'll
>> >> send a bug report to the maintainers.
>> >
>> > Interesting! Joerg was complaining about similar symptoms with .36 today
>> > too.
>>
>> Well, there is a clocksource sort-of-bug that could cause confusion:
>> when something totally unrelated to clocksources goes out to lunch,
>> the clocksource watchdog decides that the clocksource is unstable and
>> complains, steering everyone toward filing the wrong bug.
>
> How should the clocksource watchdog code know that something went to
> lunch? The fact that we need to monitor TSC at all is horrible enough,
> adding further heuristics to detect extended lunch breaks would be
> just a PITA.
Could the clocksource watchdog detect when it gets woken up (i.e. when
the hardware timer fires) instead of when it gets scheduled? It is
internal to the timing code, after all...
Alternatively, maybe the watchdog could just compare the TSC timestamp
to the current value according to some other clock (PIT? whatever
clockevent is in use?) instead of just using the time passed into the
delayed work in the first place.
>
> Maybe we could print a different warning when we see large negative
> deltas, which is the main indicator for the system being stuck for
> quite a time while TSC advances happily.
That would be an easy fix.
--Andy
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [FALSE ALARM] Re: HPET (?) related hangs and breakage in 2.6.35,36
2010-11-10 20:56 ` Andrew Lutomirski
@ 2010-11-10 20:58 ` Thomas Gleixner
0 siblings, 0 replies; 13+ messages in thread
From: Thomas Gleixner @ 2010-11-10 20:58 UTC (permalink / raw)
To: Andrew Lutomirski
Cc: Borislav Petkov, Clemens Ladisch, linux-kernel@vger.kernel.org,
Peter Zijlstra, Ingo Molnar, H. Peter Anvin, x86,
Jörg Rödel
[-- Attachment #1: Type: TEXT/PLAIN, Size: 1782 bytes --]
On Wed, 10 Nov 2010, Andrew Lutomirski wrote:
> On Wed, Nov 10, 2010 at 3:52 PM, Thomas Gleixner <tglx@linutronix.de> wrote:
> > On Wed, 10 Nov 2010, Andrew Lutomirski wrote:
> >
> >> On Wed, Nov 10, 2010 at 1:50 PM, Borislav Petkov <bp@amd64.org> wrote:
> >> > On Wed, Nov 10, 2010 at 01:48:00PM -0500, Andrew Lutomirski wrote:
> >> >> > Clocksource: tsc unstable (delta = -34355296774 ns)
> >> >> > Switching: to clocksource hpet
> >> >>
> >> >> Please disregard -- this is a bug in nouveau (or drm) not hpet. I'll
> >> >> send a bug report to the maintainers.
> >> >
> >> > Interesting! Joerg was complaining about similar symptoms with .36 today
> >> > too.
> >>
> >> Well, there is a clocksource sort-of-bug that could cause confusion:
> >> when something totally unrelated to clocksources goes out to lunch,
> >> the clocksource watchdog decides that the clocksource is unstable and
> >> complains, steering everyone toward filing the wrong bug.
> >
> > How should the clocksource watchdog code know that something went to
> > lunch? The fact that we need to monitor TSC at all is horrible enough,
> > adding further heuristics to detect extended lunch breaks would be
> > just a PITA.
>
> Could the clocksource watchdog detect when it gets woken up (i.e. when
> the hardware timer fires) instead of when it gets scheduled? It is
> internal to the timing code, after all...
>
> Alternatively, maybe the watchdog could just compare the TSC timestamp
> to the current value according to some other clock (PIT? whatever
> clockevent is in use?) instead of just using the time passed into the
> delayed work in the first place.
It compares to some other clock. i.e. HPET in your case, but the
extended lunch break was larger than the HPET wrap around time :(
Thanks,
tglx
^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [FALSE ALARM] Re: HPET (?) related hangs and breakage in 2.6.35,36
2010-11-10 18:50 ` Borislav Petkov
2010-11-10 19:02 ` Andrew Lutomirski
@ 2010-11-11 11:06 ` Roedel, Joerg
1 sibling, 0 replies; 13+ messages in thread
From: Roedel, Joerg @ 2010-11-11 11:06 UTC (permalink / raw)
To: Borislav Petkov
Cc: Andrew Lutomirski, Clemens Ladisch, linux-kernel@vger.kernel.org,
Thomas Gleixner, Peter Zijlstra, Ingo Molnar, H. Peter Anvin, x86
On Wed, Nov 10, 2010 at 01:50:31PM -0500, Borislav Petkov wrote:
> On Wed, Nov 10, 2010 at 01:48:00PM -0500, Andrew Lutomirski wrote:
> > > Clocksource: tsc unstable (delta = -34355296774 ns)
> > > Switching: to clocksource hpet
> >
> > Please disregard -- this is a bug in nouveau (or drm) not hpet. I'll
> > send a bug report to the maintainers.
>
> Interesting! Joerg was complaining about similar symptoms with .36 today
> too.
The issue I have seen was on two boxes with 890FX chipset and Phenom II
X6 CPUs on a recent avi/master which was 2.6.36 + some kvm patches.
The problem was that the box became extremly sluggish and slow. I stated
dstat and it reported a lot of missed ticks. After some time the
clocksource tsc became unstable and the system became responsive again.
I have not yet investigated this further but can do any requested tests.
Joerg
--
AMD Operating System Research Center
Advanced Micro Devices GmbH Einsteinring 24 85609 Dornach
General Managers: Alberto Bozzo, Andrew Bowd
Registration: Dornach, Landkr. Muenchen; Registerger. Muenchen, HRB Nr. 43632
^ permalink raw reply [flat|nested] 13+ messages in thread
end of thread, other threads:[~2010-11-11 11:04 UTC | newest]
Thread overview: 13+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-11-09 13:46 HPET (?) related hangs and breakage in 2.6.35,36 Andrew Lutomirski
2010-11-09 15:20 ` Clemens Ladisch
2010-11-09 15:38 ` Borislav Petkov
2010-11-09 17:44 ` Andrew Lutomirski
2010-11-09 18:01 ` Borislav Petkov
2010-11-09 21:29 ` Andrew Lutomirski
2010-11-10 18:48 ` [FALSE ALARM] " Andrew Lutomirski
2010-11-10 18:50 ` Borislav Petkov
2010-11-10 19:02 ` Andrew Lutomirski
2010-11-10 20:52 ` Thomas Gleixner
2010-11-10 20:56 ` Andrew Lutomirski
2010-11-10 20:58 ` Thomas Gleixner
2010-11-11 11:06 ` Roedel, Joerg
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.