[PATCH] fix kvmclock bug

public inbox for kvm@vger.kernel.org
 help / color / mirror / Atom feed

* [PATCH] fix kvmclock bug
@ 2010-09-19  0:15 Zachary Amsden
  2010-09-24  7:28 ` Jan Kiszka
  2010-09-29  8:58 ` Avi Kivity
  0 siblings, 2 replies; 6+ messages in thread
From: Zachary Amsden @ 2010-09-19  0:15 UTC (permalink / raw)
  To: Marcelo Tosatti, Avi Kivity, Jan Kiszka, kvm, Glauber Costa

[-- Attachment #1: Type: text/plain, Size: 661 bytes --]

For CPUs with unstable TSC, we null time offset between not just VCPU 
switches, but all preemptions of the kvm thread.  This makes a bug much 
more likely where the kvmclock values are updated before a successful 
exit from virt, causing an underflow.

The null offsetting was added at : bf0fb4a42ba7eb362f4013bd2e93209666793e66
The underflow happens with this additional patch :  
cf839f5da2b0779b9ec8b990f851fb4e7d681da0

There is a secondary bug, which is that TSC fails to advance with real 
time on unstable TSC, but the fix is much more involved (it requires the 
TSC catchup code).

For now, this patch is sufficient to get things working again for me.

[-- Attachment #2: kvmclock-preempt.patch --]
[-- Type: text/plain, Size: 1078 bytes --]

commit 1abe7e8806fd71ea802c6622ed3ce7821a18f271
Author: Zachary Amsden <zamsden@redhat.com>
Date:   Sat Sep 18 13:58:37 2010 -1000

    Fix kvmclock bug
    
    If preempted after kvmclock values are updated, but before hardware
    virtualization is entered, the last tsc time as read by the guest is
    never set.  It underflows the next time kvmclock is updated if there
    has not yet been a successful entry / exit into hardware virt.
    
    Fix this by simply setting last_tsc to the newly read tsc value so
    that any computed nsec advance of kvmclock is nulled.
    
    Signed-off-by: Zachary Amsden <zamsden@redhat.com>

diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c
index 76db85a..09f468a 100644
--- a/arch/x86/kvm/x86.c
+++ b/arch/x86/kvm/x86.c
@@ -1101,6 +1101,7 @@ static int kvm_guest_time_update(struct kvm_vcpu *v)
 	vcpu->hv_clock.tsc_timestamp = tsc_timestamp;
 	vcpu->hv_clock.system_time = kernel_ns + v->kvm->arch.kvmclock_offset;
 	vcpu->last_kernel_ns = kernel_ns;
+	vcpu->last_guest_tsc = tsc_timestamp;
 	vcpu->hv_clock.flags = 0;
 
 	/*

^ permalink raw reply related	[flat|nested] 6+ messages in thread

* Re: [PATCH] fix kvmclock bug
  2010-09-19  0:15 [PATCH] fix kvmclock bug Zachary Amsden
@ 2010-09-24  7:28 ` Jan Kiszka
  2010-09-26  9:54   ` Jan Kiszka
  2010-09-29  8:58 ` Avi Kivity
  1 sibling, 1 reply; 6+ messages in thread
From: Jan Kiszka @ 2010-09-24  7:28 UTC (permalink / raw)
  To: Zachary Amsden; +Cc: Marcelo Tosatti, Avi Kivity, kvm, Glauber Costa

[-- Attachment #1: Type: text/plain, Size: 922 bytes --]

Am 19.09.2010 02:15, Zachary Amsden wrote:
> For CPUs with unstable TSC, we null time offset between not just VCPU
> switches, but all preemptions of the kvm thread.  This makes a bug much
> more likely where the kvmclock values are updated before a successful
> exit from virt, causing an underflow.
> 
> The null offsetting was added at : bf0fb4a42ba7eb362f4013bd2e93209666793e66
> The underflow happens with this additional patch : 
> cf839f5da2b0779b9ec8b990f851fb4e7d681da0
> 
> There is a secondary bug, which is that TSC fails to advance with real
> time on unstable TSC, but the fix is much more involved (it requires the
> TSC catchup code).
> 
> For now, this patch is sufficient to get things working again for me.

...but not for me. I still face stuck (or infinitely slow) guests that
want to use kvmclock once tsc_unstable gets set. Or is this patch
addressing a different issue?

Jan


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 259 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] fix kvmclock bug
  2010-09-24  7:28 ` Jan Kiszka
@ 2010-09-26  9:54   ` Jan Kiszka
  2010-09-27 19:00     ` Zachary Amsden
  0 siblings, 1 reply; 6+ messages in thread
From: Jan Kiszka @ 2010-09-26  9:54 UTC (permalink / raw)
  To: Zachary Amsden; +Cc: Marcelo Tosatti, Avi Kivity, kvm, Glauber Costa

[-- Attachment #1: Type: text/plain, Size: 1399 bytes --]

Am 24.09.2010 09:28, Jan Kiszka wrote:
> Am 19.09.2010 02:15, Zachary Amsden wrote:
>> For CPUs with unstable TSC, we null time offset between not just VCPU
>> switches, but all preemptions of the kvm thread.  This makes a bug much
>> more likely where the kvmclock values are updated before a successful
>> exit from virt, causing an underflow.
>>
>> The null offsetting was added at : bf0fb4a42ba7eb362f4013bd2e93209666793e66
>> The underflow happens with this additional patch : 
>> cf839f5da2b0779b9ec8b990f851fb4e7d681da0
>>
>> There is a secondary bug, which is that TSC fails to advance with real
>> time on unstable TSC, but the fix is much more involved (it requires the
>> TSC catchup code).
>>
>> For now, this patch is sufficient to get things working again for me.
> 
> ...but not for me. I still face stuck (or infinitely slow) guests that
> want to use kvmclock once tsc_unstable gets set. Or is this patch
> addressing a different issue?

Commit bfb3f332 ("TSC catchup mode") in kvm.git finally resolves the
issue here.

That only leaves us with the likely wrong unstable declaration of the
TSC after resume. And that raises the question for me if KVM is actually
that much smarter than the Linux kernel in detecting TSC jumps. If
something is missing, can't we improve the kernel's detection mechanism
which already has suspend/resume support?

Jan


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 259 bytes --]

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] fix kvmclock bug
  2010-09-26  9:54   ` Jan Kiszka
@ 2010-09-27 19:00     ` Zachary Amsden
  2010-09-28  8:58       ` Jan Kiszka
  0 siblings, 1 reply; 6+ messages in thread
From: Zachary Amsden @ 2010-09-27 19:00 UTC (permalink / raw)
  To: Jan Kiszka; +Cc: Marcelo Tosatti, Avi Kivity, kvm, Glauber Costa

On 09/25/2010 11:54 PM, Jan Kiszka wrote:
> Am 24.09.2010 09:28, Jan Kiszka wrote:
>    
>> Am 19.09.2010 02:15, Zachary Amsden wrote:
>>      
>>> For CPUs with unstable TSC, we null time offset between not just VCPU
>>> switches, but all preemptions of the kvm thread.  This makes a bug much
>>> more likely where the kvmclock values are updated before a successful
>>> exit from virt, causing an underflow.
>>>
>>> The null offsetting was added at : bf0fb4a42ba7eb362f4013bd2e93209666793e66
>>> The underflow happens with this additional patch :
>>> cf839f5da2b0779b9ec8b990f851fb4e7d681da0
>>>
>>> There is a secondary bug, which is that TSC fails to advance with real
>>> time on unstable TSC, but the fix is much more involved (it requires the
>>> TSC catchup code).
>>>
>>> For now, this patch is sufficient to get things working again for me.
>>>        
>> ...but not for me. I still face stuck (or infinitely slow) guests that
>> want to use kvmclock once tsc_unstable gets set. Or is this patch
>> addressing a different issue?
>>      
> Commit bfb3f332 ("TSC catchup mode") in kvm.git finally resolves the
> issue here.
>    

Good to hear that!

> That only leaves us with the likely wrong unstable declaration of the
> TSC after resume. And that raises the question for me if KVM is actually
> that much smarter than the Linux kernel in detecting TSC jumps. If
> something is missing, can't we improve the kernel's detection mechanism
> which already has suspend/resume support?
>    

Linux must make the the conservative choice about TSC being declared 
unstable; if it is possible that it has become unstable, it is 
unstable.  Unfortunately, this bodes not well for us, as most of the 
finer points of accuracy depend on having a stable TSC.

There's a bunch of places that declare TSC unstable, and where in the 
suspend / resume cycle that happens would depend on your actual hardware.

Zach

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] fix kvmclock bug
  2010-09-27 19:00     ` Zachary Amsden
@ 2010-09-28  8:58       ` Jan Kiszka
  0 siblings, 0 replies; 6+ messages in thread
From: Jan Kiszka @ 2010-09-28  8:58 UTC (permalink / raw)
  To: Zachary Amsden; +Cc: Marcelo Tosatti, Avi Kivity, kvm, Glauber Costa

Am 27.09.2010 21:00, Zachary Amsden wrote:
> On 09/25/2010 11:54 PM, Jan Kiszka wrote:
>> That only leaves us with the likely wrong unstable declaration of the
>> TSC after resume. And that raises the question for me if KVM is actually
>> that much smarter than the Linux kernel in detecting TSC jumps. If
>> something is missing, can't we improve the kernel's detection mechanism
>> which already has suspend/resume support?
>>    
> 
> Linux must make the the conservative choice about TSC being declared
> unstable; if it is possible that it has become unstable, it is
> unstable.  Unfortunately, this bodes not well for us, as most of the
> finer points of accuracy depend on having a stable TSC.
> 
> There's a bunch of places that declare TSC unstable, and where in the
> suspend / resume cycle that happens would depend on your actual hardware.

It's absolutely clear where this happens: kvm_arch_vcpu_load. And it
seems to happen as the TSC is reset due to suspend-to-RAM.

Again: Linux recovers from this and continues to use the TSC. KVM is
more picky, so my question is if this is really required.

Jan

-- 
Siemens AG, Corporate Technology, CT T DE IT 1
Corporate Competence Center Embedded Linux

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH] fix kvmclock bug
  2010-09-19  0:15 [PATCH] fix kvmclock bug Zachary Amsden
  2010-09-24  7:28 ` Jan Kiszka
@ 2010-09-29  8:58 ` Avi Kivity
  1 sibling, 0 replies; 6+ messages in thread
From: Avi Kivity @ 2010-09-29  8:58 UTC (permalink / raw)
  To: Zachary Amsden; +Cc: Marcelo Tosatti, Jan Kiszka, kvm, Glauber Costa

  On 09/19/2010 02:15 AM, Zachary Amsden wrote:
> commit 1abe7e8806fd71ea802c6622ed3ce7821a18f271
> Author: Zachary Amsden<zamsden@redhat.com>
> Date:   Sat Sep 18 13:58:37 2010 -1000
>
>      Fix kvmclock bug

I think there's some redundancy here.  Anyone who has tracked your 
kernel work knows you've done a lot of work on kvmclock, so "Fix bug" 
would be just as descriptive.  Of course, if we fix something, it's 
because it was a bug, so "Fix" is all that's really necessary ("bug" 
would be reserved for commits that introduce bugs).  In the case of 
kvmclock, it's hard to see how new features could be added, and it has a 
pretty bad history of bugs, so most readers would probably deduce that a 
commit fixes a bug.  I don't really see why you wrote a subject line at all.

-- 
I have a truly marvellous patch that fixes the bug which this
signature is too narrow to contain.

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2010-09-29  8:58 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-09-19  0:15 [PATCH] fix kvmclock bug Zachary Amsden
2010-09-24  7:28 ` Jan Kiszka
2010-09-26  9:54   ` Jan Kiszka
2010-09-27 19:00     ` Zachary Amsden
2010-09-28  8:58       ` Jan Kiszka
2010-09-29  8:58 ` Avi Kivity

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox