kvm.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Gleb Natapov <gleb@redhat.com>
To: Marcelo Tosatti <mtosatti@redhat.com>
Cc: seabios@seabios.org, Avi Kivity <avi@redhat.com>,
	kvm@vger.kernel.org, Gerd Hoffmann <kraxel@redhat.com>
Subject: Re: [PATCH] tsc: use kvmclock for calibration
Date: Fri, 10 Aug 2012 10:18:00 +0300	[thread overview]
Message-ID: <20120810071800.GA24410@redhat.com> (raw)
In-Reply-To: <20120809190913.GG20889@amt.cnet>

On Thu, Aug 09, 2012 at 04:09:13PM -0300, Marcelo Tosatti wrote:
> On Thu, Aug 09, 2012 at 05:01:34PM +0300, Avi Kivity wrote:
> > On 08/09/2012 04:57 PM, Gerd Hoffmann wrote:
> > >   Hi,
> > > 
> > >>> +u64 kvm_tsc_khz(void)
> > >>> +{
> > >>> +    u32 eax, ebx, ecx, edx, msr;
> > >>> +    struct pvclock_vcpu_time_info time;
> > >>> +    u32 addr = (u32)(&time);
> > >>> +    u64 khz;
> > >>> +
> > >>> +    /* check presence and figure msr number */
> > >>> +    cpuid(KVM_CPUID_FEATURES, &eax, &ebx, &ecx, &edx);
> > >>> +    if (eax & KVM_FEATURE_CLOCKSOURCE2) {
> > >>> +        msr = MSR_KVM_SYSTEM_TIME_NEW;
> > >>> +    } else if (eax & KVM_FEATURE_CLOCKSOURCE) {
> > >>> +        msr = MSR_KVM_SYSTEM_TIME;
> > >>> +    } else {
> > >>> +        return 0;
> > >>> +    }
> > >>> +
> > >>> +    /* ask kvm hypervisor to fill struct */
> > >>> +    memset(&time, 0, sizeof(time));
> > >>> +    wrmsr(msr, addr | 1);
> > >> 
> > >> How can this work?
> > > 
> > > It did in my testing, although maybe by pure luck ...
> > > 
> > >> There is a 64-byte alignment requirement.
> > > 
> > > 64 bytes?  Sure?  The whole struct is only 32 bytes in size ...
> > 
> > er, the documentation says 4 bytes (so stack alignment works).  I
> > distinctly remember having a large alignment requirement so we don't
> > cross a page or slot boundary... something's wrong here.
> > 
> > > 
> > > Easily fixable though, just need to grab some memory with memalign
> > > instead of using the stack.
> > 
> > > 
> > >>> +    wrmsr(msr, 0);
> > >>> +    if (time.version < 2 || time.tsc_to_system_mul == 0)
> > >>> +        return 0;
> > >>> +
> > >>> +    /* go figure tsc frequency */
> > >>> +    khz = pvclock_tsc_khz(&time);
> > >>> +    dprintf(1, "Using kvmclock, msr 0x%x, tsc %d MHz\n",
> > >>> +            msr, (u32)khz / 1000);
> > >>> +    return khz;
> > >> 
> > >> That's a meaningless number.  You can be migrated to a cpu or a machine
> > >> with very different tsc.
> > > 
> > >> You want accurate time on kvm, don't use the tsc.
> > > 
> > > seabios uses the tsc for timeout calculations only, so it doesn't need
> > > to be 100% accurate.  The order of magnitude should be correct though.
> > > The Linux kernel uses the value for delay loops too, so using it for the
> > > given purpose can't be *that* horrible after all ...
> > > 
> > > It is certainly an improvement over the current code which tries to
> > > calibrate the tsc and gets totally broken results in case the busy host
> > > happens to schedule the guest in the middle of calibration.
> > > 
> > > So what do you suggest?  The options I see are:
> > > 
> > >   (1) Use this patch (with alignment issue fixed of course).
> > >   (2) Do a full kvmclock implementation.  Feels a bit like overkill.
> > >   (3) SeaBIOS can fallback to the PIT for timing on machines which
> > >       have no TSC.  We could do that too in case we detect kvm ...
> > 
> > What sort of timeouts are these?  If seconds, maybe the rtc would be best.
> 
> I vote for 3 so nobody has to maintain kvmclock code in SeaBIOS and Gerd
That or pm timer.

> can fix the in-kernel PIT issues with GRUB (see Michaels message) while testing.
> 
What message exactly?

--
			Gleb.

  reply	other threads:[~2012-08-10  7:18 UTC|newest]

Thread overview: 23+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-08-09 11:57 [PATCH] tsc: use kvmclock for calibration Gerd Hoffmann
2012-08-09 12:53 ` Avi Kivity
2012-08-09 13:25   ` [SeaBIOS] " Fred .
2012-08-09 13:57   ` Gerd Hoffmann
2012-08-09 14:01     ` Avi Kivity
2012-08-09 14:05       ` Avi Kivity
2012-08-09 14:12         ` Gerd Hoffmann
2012-08-09 14:17           ` Avi Kivity
2012-08-09 14:18       ` Gerd Hoffmann
2012-08-09 14:20         ` Avi Kivity
2012-08-09 19:02           ` Marcelo Tosatti
2012-08-12 10:56             ` Avi Kivity
2012-08-09 19:09       ` Marcelo Tosatti
2012-08-10  7:18         ` Gleb Natapov [this message]
2012-08-10  7:30           ` Gleb Natapov
2012-08-10  8:10         ` Gerd Hoffmann
2012-08-10 21:26           ` Marcelo Tosatti
2012-08-13 10:37             ` Gerd Hoffmann
2012-08-13 10:46               ` Gleb Natapov
2012-08-13 12:55                 ` [SeaBIOS] " Fred .
2012-08-12  9:00           ` Avi Kivity
2012-08-09 18:59   ` Marcelo Tosatti
2012-08-12  9:01     ` Avi Kivity

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20120810071800.GA24410@redhat.com \
    --to=gleb@redhat.com \
    --cc=avi@redhat.com \
    --cc=kraxel@redhat.com \
    --cc=kvm@vger.kernel.org \
    --cc=mtosatti@redhat.com \
    --cc=seabios@seabios.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).