From: "Radim Krčmář" <rkrcmar@redhat.com>
To: Marcelo Tosatti <mtosatti@redhat.com>
Cc: Paolo Bonzini <pbonzini@redhat.com>, kvm-devel <kvm@vger.kernel.org>
Subject: Re: [PATCH v2] KVM: x86: fix KVM_SET_CLOCK relative to setting correct clock value
Date: Fri, 12 May 2017 19:37:12 +0200 [thread overview]
Message-ID: <20170512173711.GA13226@potion> (raw)
In-Reply-To: <20170512153101.GA1848@amt.cnet>
2017-05-12 12:31-0300, Marcelo Tosatti:
> On Fri, May 12, 2017 at 04:13:23PM +0200, Radim Krčmář wrote:
> > 2017-05-11 12:39-0300, Marcelo Tosatti:
> > > On Wed, May 10, 2017 at 08:04:31PM +0200, Radim Krčmář wrote:
> > > > 2017-05-03 10:43-0300, Marcelo Tosatti:
> > > > and the important fix for kvm master clock is the move of
> > > > kvm_gen_update_masterclock() before we read the time.
> >
> > > > The rest is just a minor optimization that also ignores time since
> > > > master_kernel_ns() and therefore pins user_ns.clock to a slightly
> > > > earlier time.
> > > >
> > > > But all attention was given to the "minor optimization" -- have I missed
> > > > something about the direct use of ka->master_kernel_ns?
> > >
> > > I haven't attempted to optimize anything. Not sure what you mean.
> >
> > I mean, why doesn't the patch look like this?
> d
> > diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c
> > index 464da936c53d..8db1d09e59d7 100644
> > --- a/arch/x86/kvm/x86.c
> > +++ b/arch/x86/kvm/x86.c
> > @@ -4175,9 +4175,10 @@ long kvm_arch_vm_ioctl(struct file *filp,
> > goto out;
> >
> > r = 0;
> > + kvm_gen_update_masterclock(kvm);
> > now_ns = get_kvmclock_ns(kvm);
> > kvm->arch.kvmclock_offset += user_ns.clock - now_ns;
> > - kvm_gen_update_masterclock(kvm);
> > + kvm_make_all_cpus_request(kvm, KVM_REQ_CLOCK_UPDATE);
> > break;
>
> now_ns = ka->master_kernel_ns + kvmclock_offset_prev + (grdtsc() - ka->master_cycle_now)
> kvmclock_offset += user_ns.clock - (ka->master_kernel_ns + kvmclock_offset_prev + (grdtsc() - ka->master_cycle_now)
> kvmclock_offset = kvmclock_offset_prev + user_ns.clock - (ka->master_kernel_ns + kvmclock_offset_prev + (grdtsc() - ka->master_cycle_now)
>
> In case of VM was just initialized before migration, kvmclock_offset_prev is -ktime_get_boot_ns()
>
> kvmclock_offset = -ktime_get_boot_ns() + user_ns.clock - (ka->master_kernel_ns -ktime_get_boot_ns() + grdtsc() - ka->master_cycle_now))
>
> But master_kernel_ns = ktime_get_boot_ns() + delta-between-vm-init-and-KVM_SET_CLOCK (AKA delta)
> (the same one from VM init)
>
> kvmclock_offset = -ktime_get_boot_ns() + user_ns.clock - (ktime_get_boot_ns() + delta + -ktime_get_boot_ns() + grdtsc() - ka->master_cycle_now))
>
> kvmclock_offset = -ktime_get_boot_ns() + user_ns.clock - delta - grdtsc() + ka->master_cycle_now
>
> But we don't want grdtsc() - ka->master_cycle_now in there.
>
> Note: grdtsc() == guest read tsc.
>
> Now with
>
> + kvm->arch.kvmclock_offset = user_ns.clock -
> + ka->master_kernel_ns;
>
> What happens is that guest clock starts counting, via kernel timekeeper,
> at the moment kvm_get_time_and_clockread() runs. If you add grdtsc() -
> ka->master_cycle_now in there, you are mindfully counting clock twice
> (first: kernel timekeeper, second: the TSC between the (grdtsc() -
> ka->master_cycle_now) in question.
>
> + kvm->arch.kvmclock_offset = -ktime_get_boot_ns() +user_ns.clock -delta
>
> Note that (grdtsc() - ka->master_cycle_now) is susceptible to scheduling
> etc.
>
> Makes sense?
Yes. The simpler code starts the kvmclock a bit later, but both are
correct -- anything within KVM_SET_CLOCK runtime is.
If we care about accuracy, then we should let userspace provide a
(kernel timestamp, kvm timestamp) pair, so the value of kvmclock can
really be controlled.
Adding ugly optimizations to work around shortcomings of the API is
going the wrong way ...
next prev parent reply other threads:[~2017-05-12 17:37 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-05-02 21:36 [PATCH] KVM: x86: fix KVM_SET_CLOCK relative to setting correct clock value Marcelo Tosatti
2017-05-03 13:08 ` Paolo Bonzini
2017-05-03 13:40 ` Marcelo Tosatti
2017-05-03 13:55 ` Paolo Bonzini
2017-05-03 18:24 ` Marcelo Tosatti
2017-05-03 13:43 ` [PATCH v2] " Marcelo Tosatti
2017-05-10 18:04 ` Radim Krčmář
2017-05-11 15:39 ` Marcelo Tosatti
2017-05-12 14:13 ` Radim Krčmář
2017-05-12 14:36 ` Paolo Bonzini
2017-05-12 15:31 ` Marcelo Tosatti
2017-05-12 17:37 ` Radim Krčmář [this message]
2017-05-13 3:46 ` Marcelo Tosatti
2017-05-15 16:19 ` Radim Krčmář
2017-05-15 21:06 ` Marcelo Tosatti
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20170512173711.GA13226@potion \
--to=rkrcmar@redhat.com \
--cc=kvm@vger.kernel.org \
--cc=mtosatti@redhat.com \
--cc=pbonzini@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox