From mboxrd@z Thu Jan 1 00:00:00 1970 From: Marcelo Tosatti Subject: Re: [patch 12/18] x86: vdso: pvclock gettime support Date: Mon, 29 Oct 2012 16:42:49 -0200 Message-ID: <20121029184249.GC30422@amt.cnet> References: <20121024131340.742340256@redhat.com> <20121024131621.840274360@redhat.com> <508E99D7.2050402@parallels.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Cc: kvm@vger.kernel.org, johnstul@us.ibm.com, jeremy@goop.org, zamsden@gmail.com, gleb@redhat.com, avi@redhat.com, pbonzini@redhat.com To: Glauber Costa Return-path: Received: from mx1.redhat.com ([209.132.183.28]:36914 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1760482Ab2J2SnP (ORCPT ); Mon, 29 Oct 2012 14:43:15 -0400 Content-Disposition: inline In-Reply-To: <508E99D7.2050402@parallels.com> Sender: kvm-owner@vger.kernel.org List-ID: On Mon, Oct 29, 2012 at 06:59:35PM +0400, Glauber Costa wrote: > On 10/24/2012 05:13 PM, Marcelo Tosatti wrote: > > Improve performance of time system calls when using Linux pvclock, > > by reading time info from fixmap visible copy of pvclock data. > > > > Originally from Jeremy Fitzhardinge. > > > > Signed-off-by: Marcelo Tosatti > > > > Index: vsyscall/arch/x86/vdso/vclock_gettime.c > > =================================================================== > > --- vsyscall.orig/arch/x86/vdso/vclock_gettime.c > > +++ vsyscall/arch/x86/vdso/vclock_gettime.c > > @@ -22,6 +22,7 @@ > > #include > > #include > > #include > > +#include > > > > #define gtod (&VVAR(vsyscall_gtod_data)) > > > > @@ -62,6 +63,69 @@ static notrace cycle_t vread_hpet(void) > > return readl((const void __iomem *)fix_to_virt(VSYSCALL_HPET) + 0xf0); > > } > > > > +#ifdef CONFIG_PARAVIRT_CLOCK_VSYSCALL > > + > > +static notrace const struct pvclock_vsyscall_time_info *get_pvti(int cpu) > > +{ > > + const aligned_pvti_t *pvti_base; > > + int idx = cpu / (PAGE_SIZE/PVTI_SIZE); > > + int offset = cpu % (PAGE_SIZE/PVTI_SIZE); > > + > > + BUG_ON(PVCLOCK_FIXMAP_BEGIN + idx > PVCLOCK_FIXMAP_END); > > + > > + pvti_base = (aligned_pvti_t *)__fix_to_virt(PVCLOCK_FIXMAP_BEGIN+idx); > > + > > + return &pvti_base[offset].info; > > +} > > + > > Unless I am missing something, if gcc decides to not inline get_pvti, > this will break, right? I believe you need to mark that function with > __always_inline. Can't see why. Please enlighten me. > > > +static notrace cycle_t vread_pvclock(int *mode) > > +{ > > + const struct pvclock_vsyscall_time_info *pvti; > > + cycle_t ret; > > + u64 last; > > + u32 version; > > + u32 migrate_count; > > + u8 flags; > > + unsigned cpu, cpu1; > > + > > + > > + /* > > + * When looping to get a consistent (time-info, tsc) pair, we > > + * also need to deal with the possibility we can switch vcpus, > > + * so make sure we always re-fetch time-info for the current vcpu. > > + */ > > + do { > > + cpu = __getcpu() & 0xfff; > > Please wrap this 0xfff into something meaningful. OK. > > + pvti = get_pvti(cpu); > > + > > + migrate_count = pvti->migrate_count; > > + > > + version = __pvclock_read_cycles(&pvti->pvti, &ret, &flags); > > + > > + /* > > + * Test we're still on the cpu as well as the version. > > + * We could have been migrated just after the first > > + * vgetcpu but before fetching the version, so we > > + * wouldn't notice a version change. > > + */ > > + cpu1 = __getcpu() & 0xfff; > > + } while (unlikely(cpu != cpu1 || > > + (pvti->pvti.version & 1) || > > + pvti->pvti.version != version || > > + pvti->migrate_count != migrate_count)); > > + > > + if (unlikely(!(flags & PVCLOCK_TSC_STABLE_BIT))) > > + *mode = VCLOCK_NONE; > > + > > + last = VVAR(vsyscall_gtod_data).clock.cycle_last; > > + > > + if (likely(ret >= last)) > > + return ret; > > + > > Please add a comment here referring to tsc.c, where an explanation of > this test lives. This is quite non-obvious for the non initiated. OK.