From: Marcelo Tosatti <mtosatti@redhat.com>
To: Andy Lutomirski <luto@amacapital.net>
Cc: Paolo Bonzini <pbonzini@redhat.com>,
kvm list <kvm@vger.kernel.org>,
Luiz Capitulino <lcapitulino@redhat.com>,
Rik van Riel <riel@redhat.com>, Radim Krcmar <rkrcmar@redhat.com>
Subject: Re: [patch 2/2] KVM: x86: add option to advance tscdeadline hrtimer expiration
Date: Thu, 11 Dec 2014 19:29:31 -0200 [thread overview]
Message-ID: <20141211212931.GA24137@amt.cnet> (raw)
In-Reply-To: <20141211212717.GA22999@amt.cnet>
On Thu, Dec 11, 2014 at 07:27:17PM -0200, Marcelo Tosatti wrote:
> On Thu, Dec 11, 2014 at 01:16:52PM -0800, Andy Lutomirski wrote:
> > On Thu, Dec 11, 2014 at 1:10 PM, Paolo Bonzini <pbonzini@redhat.com> wrote:
> > >
> > >
> > > On 11/12/2014 21:48, Andy Lutomirski wrote:
> > >> On 12/10/2014 07:07 PM, Marcelo Tosatti wrote:
> > >>> On Thu, Dec 11, 2014 at 12:37:57AM +0100, Paolo Bonzini wrote:
> > >>>>
> > >>>>
> > >>>> On 10/12/2014 21:57, Marcelo Tosatti wrote:
> > >>>>> For the hrtimer which emulates the tscdeadline timer in the guest,
> > >>>>> add an option to advance expiration, and busy spin on VM-entry waiting
> > >>>>> for the actual expiration time to elapse.
> > >>>>>
> > >>>>> This allows achieving low latencies in cyclictest (or any scenario
> > >>>>> which requires strict timing regarding timer expiration).
> > >>>>>
> > >>>>> Reduces cyclictest avg latency by 50%.
> > >>>>>
> > >>>>> Note: this option requires tuning to find the appropriate value
> > >>>>> for a particular hardware/guest combination. One method is to measure the
> > >>>>> average delay between apic_timer_fn and VM-entry.
> > >>>>> Another method is to start with 1000ns, and increase the value
> > >>>>> in say 500ns increments until avg cyclictest numbers stop decreasing.
> > >>>>
> > >>>> What values are you using in practice for the parameter?
> > >>>
> > >>> 7us.
> > >>
> > >> It takes 7us to get from TSC deadline expiration to the *start* of
> > >> vmresume? That seems rather extreme.
> > >
> > > No, to the end. 7us is 21000 clock cycles, and the vmexit+vmentry alone
> > > costs about 1300.
> > >
> >
> > I suspect that something's massively wrong with context switching,
> > then -- it deserves to be considerably faster than that. The
> > architecturally expensive bits are vmresume, interrupt delivery, and
> > iret, but iret is only ~300 cycles and interrupt delivery should be
> > under 1k cycles.
> >
> > Throw in a few hundred more cycles for whatever wrmsr idiocy is going
> > on somewhere in the process, and we're still nowhere near 21k cycles.
>
>
> <idle>-0 [003] d..h2.. 1991756745496752: apic_timer_fn
> <-__run_hrtimer
> <idle>-0 [003] dN.h2.. 1991756745498732: tick_program_event <-hrtimer_interrupt
> <idle>-0 [003] d...3.. 1991756745502112: sched_switch: prev_comm=swapper/3 prev_pid=0 prev_prio=120 prev_state=R ==> next_comm=qemu-system-x86 next_pid=20114 next_prio=98
> <idle>-0 [003] d...2.. 1991756745502592: __context_tracking_task_switch <-__schedule
> qemu-system-x86-20114 [003] ....1.. 1991756745503916: kvm_arch_vcpu_load <-kvm_sched_in
> qemu-system-x86-20114 [003] ....... 1991756745505320: kvm_cpu_has_pending_timer <-kvm_vcpu_block
> qemu-system-x86-20114 [003] ....... 1991756745506260: kvm_cpu_has_pending_timer <-kvm_arch_vcpu_ioctl_run
> qemu-system-x86-20114 [003] ....... 1991756745507812: kvm_apic_accept_events <-kvm_arch_vcpu_ioctl_run
> qemu-system-x86-20114 [003] ....... 1991756745508100: kvm_cpu_has_pending_timer <-kvm_arch_vcpu_ioctl_run
> qemu-system-x86-20114 [003] ....... 1991756745508872: kvm_apic_accept_events <-vcpu_enter_guest
> qemu-system-x86-20114 [003] ....1.. 1991756745510040: vmx_save_host_state <-vcpu_enter_guest
> qemu-system-x86-20114 [003] d...2.. 1991756745511876: kvm_entry: vcpu 1
>
>
> 1991756745511876 - 1991756745496752 = 15124
>
> The timestamps are TSC reads.
>
> This is patched to run without ksoftirqd. Consider:
>
> The LAPIC is programmed to the next earliest event by hrtimer_interrupt.
> VM-entry is processing KVM_REQ_DEACTIVATE_FPU, KVM_REQ_EVENT.
>
model : 58
model name : Intel(R) Core(TM) i5-3470S CPU @ 2.90GHz
stepping : 9
microcode : 0x1b
cpu MHz : 2873.492
cache size : 6144 KB
next prev parent reply other threads:[~2014-12-11 21:29 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-12-10 20:57 [patch 0/2] KVM: add option to advance tscdeadline hrtimer expiration (v3) Marcelo Tosatti
2014-12-10 20:57 ` [patch 1/2] KVM: x86: add method to test PIR bitmap vector Marcelo Tosatti
2014-12-10 20:57 ` [patch 2/2] KVM: x86: add option to advance tscdeadline hrtimer expiration Marcelo Tosatti
2014-12-10 23:37 ` Paolo Bonzini
2014-12-11 3:07 ` Marcelo Tosatti
2014-12-11 18:58 ` Paolo Bonzini
2014-12-11 20:48 ` Andy Lutomirski
2014-12-11 20:58 ` Marcelo Tosatti
2014-12-11 21:07 ` Andy Lutomirski
2014-12-11 21:37 ` Rik van Riel
2014-12-11 21:10 ` Paolo Bonzini
2014-12-11 21:16 ` Andy Lutomirski
2014-12-11 21:27 ` Marcelo Tosatti
2014-12-11 21:29 ` Marcelo Tosatti [this message]
2014-12-12 18:35 ` Radim Krcmar
-- strict thread matches above, loose matches on Subject: below --
2014-12-10 17:06 [patch 0/2] KVM: add option to advance tscdeadline hrtimer expiration (v2) Marcelo Tosatti
2014-12-10 17:06 ` [patch 2/2] KVM: x86: add option to advance tscdeadline hrtimer expiration Marcelo Tosatti
2014-12-10 17:11 ` Rik van Riel
2014-12-10 16:53 [patch 0/2] KVM: " Marcelo.Tosatti
2014-12-10 16:53 ` [patch 2/2] KVM: x86: " Marcelo.Tosatti
2014-12-10 17:08 ` Paolo Bonzini
2014-12-10 17:34 ` Marcelo Tosatti
2014-12-10 17:53 ` Paolo Bonzini
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20141211212931.GA24137@amt.cnet \
--to=mtosatti@redhat.com \
--cc=kvm@vger.kernel.org \
--cc=lcapitulino@redhat.com \
--cc=luto@amacapital.net \
--cc=pbonzini@redhat.com \
--cc=riel@redhat.com \
--cc=rkrcmar@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox