From: Zachary Amsden <zamsden@redhat.com>
To: Takuya Yoshikawa <yoshikawa.takuya@oss.ntt.co.jp>
Cc: KVM <kvm@vger.kernel.org>, Avi Kivity <avi@redhat.com>,
Marcelo Tosatti <mtosatti@redhat.com>,
Glauber Costa <glommer@redhat.com>,
Linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: [PATCH 18/18] Add timekeeping documentation
Date: Wed, 14 Jul 2010 10:28:01 -1000 [thread overview]
Message-ID: <4C3E1DD1.4070400@redhat.com> (raw)
In-Reply-To: <4C3D643D.6070705@oss.ntt.co.jp>
On 07/13/2010 09:16 PM, Takuya Yoshikawa wrote:
> Hi,
>
> (2010/07/13 11:25), Zachary Amsden wrote:
>> +
>> +2.3) APIC
>> +
>> +On Pentium and later processors, an on-board timer is available to
>> each CPU
>> +as part of the Advanced Programmable Interrupt Controller. The APIC is
>> +accessed through memory-mapped registers and provides interrupt
>> service to each
>> +CPU, used for IPIs and local timer interrupts.
>> +
>> +Although in theory the APIC is a safe and stable source for local
>> interrupts,
>> +in practice, many bugs and glitches have occurred due to the special
>> nature of
>> +the APIC CPU-local memory-mapped hardware. Beware that CPU errata
>> may affect
>> +the use of the APIC and that workarounds may be required. In
>> addition, some of
>> +these workarounds pose unique constraints for virtualization -
>> requiring either
>> +extra overhead incurred from extra reads of memory-mapped I/O or
>> additional
>> +functionality that may be more computationally expensive to implement.
>> +
>> +Since the APIC is documented quite well in the Intel and AMD
>> manuals, we will
>> +avoid repititon of the detail here. It should be pointed out that
>> the APIC
>
> repetition?
>
>> +timer is programmed through the LVT (local vector timer) register,
>> is capable
>> +of one-shot or periodic operation, and is based on the bus clock
>> divided down
>> +by the programmable divider register.
>> +
>> +2.4) HPET
>> +
>> +HPET is quite complex, and was originally intended to replace the
>> PIT / RTC
>> +support of the X86 PC. It remains to be seen whether that will be
>> the case, as
>> +the de facto standard of PC hardware is to emulate these older
>> devices. Some
>> +systems designated as legacy free may support only the HPET as a
>> hardware timer
>> +device.
>> +
>> +The HPET spec is rather loose and vague, requiring at least 3
>> hardware timers,
>> +but allowing implementation freedom to support many more. It also
>> imposes no
>> +fixed rate on the timer frequency, but does impose some extremal
>> values on
>> +frequency, error and slew.
>> +
>> +In general, the HPET is recommended as a high precision (compared to
>> PIT /RTC)
>> +time source which is independent of local variation (as there is
>> only one HPET
>> +in any given system). The HPET is also memory-mapped, and its
>> presence is
>> +indicated through ACPI tables by the BIOS.
>> +
>> +Detailed specification of the HPET is beyond the current scope of this
>> +document, as it is also very well documented elsewhere.
>> +
>
>
>> +3.6) TSC and STPCLK / T-states
>> +
>> +External signals given to the processor may also have the affect of
>> stopping
>
> effect?
>
>> +the TSC. This is typically done for thermal emergency power control
>> to prevent
>> +an overheating condition, and typically, there is no way to detect
>> that this
>> +condition has happened.
>> +
>
>
>> +4.4) Migration
>> +
>> +Migration of a virtual machine raises problems for timekeeping in
>> two ways.
>> +First, the migration itself may take time, during which interrupts
>> cannot be
>> +delivered, and after which, the guest time may need to be caught
>> up. NTP may
>> +be able to help to some degree here, as the clock correction
>> required is
>> +typically small enough to fall in the NTP-correctable window.
>> +
>> +An additional concern is that timers based off the TSC (or HPET, if
>> the raw bus
>> +clock is exposed) may now be running at different rates, requiring
>> compensation
>> +in some may in the hypervisor by virtualizing these timers. In
>> addition,
>
> way?
>
>> +migrating to a faster machine may preclude the use of a passthrough
>> TSC, as a
>> +faster clock cannot be made visible to a guest without the potential
>> of time
>> +advancing faster than usual. A slower clock is less of a problem,
>> as it can
>> +always be caught up to the original rate. KVM clock avoids these
>> problems by
>> +simply storing multipliers and offsets gainst the TSC for the guest
>> to convert
>
> against?
>
>> +back into nanosecond resolution values.
>> +
>
>
> Takuya
>
> -- I'm not English speaker, so not so sure about some places.
Thanks, you found some mistakes anyway, will fix.
Cheers,
Zach
next prev parent reply other threads:[~2010-07-14 20:28 UTC|newest]
Thread overview: 62+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-07-13 2:25 KVM timekeeping fixes, V2 Zachary Amsden
2010-07-13 2:25 ` [PATCH 01/18] Make TSC offset writes non-preemptible Zachary Amsden
2010-07-13 21:33 ` Rik van Riel
2010-07-18 14:28 ` Avi Kivity
2010-07-18 14:30 ` Avi Kivity
2010-07-13 2:25 ` [PATCH 02/18] Fix SVM VMCB reset Zachary Amsden
2010-07-13 21:37 ` Rik van Riel
2010-07-13 2:25 ` [PATCH 03/18] TSC reset compensation Zachary Amsden
2010-07-13 22:11 ` Rik van Riel
2010-07-18 14:34 ` Avi Kivity
2010-07-19 20:01 ` Zachary Amsden
2010-07-13 2:25 ` [PATCH 04/18] Make cpu_tsc_khz updates use local CPU Zachary Amsden
2010-07-14 14:41 ` Rik van Riel
2010-07-18 14:45 ` Avi Kivity
2010-07-19 20:06 ` Zachary Amsden
2010-07-20 8:53 ` Avi Kivity
2010-07-20 21:57 ` Zachary Amsden
2010-07-13 2:25 ` [PATCH 05/18] Warn about unstable TSC Zachary Amsden
2010-07-14 15:02 ` Rik van Riel
2010-07-18 14:47 ` Avi Kivity
2010-07-13 2:25 ` [PATCH 06/18] Unify TSC logic Zachary Amsden
2010-07-14 15:53 ` Rik van Riel
2010-07-13 2:25 ` [PATCH 07/18] Fix deep C-state TSC desynchronization Zachary Amsden
2010-07-14 16:14 ` Rik van Riel
2010-07-13 2:25 ` [PATCH 08/18] Add helper functions for time computation Zachary Amsden
2010-07-14 19:02 ` Rik van Riel
2010-07-13 2:25 ` [PATCH 09/18] Robust TSC compensation Zachary Amsden
2010-07-13 20:34 ` Marcelo Tosatti
2010-07-13 21:15 ` Zachary Amsden
2010-07-13 21:42 ` David S. Ahern
2010-07-13 21:45 ` Zachary Amsden
2010-07-13 23:32 ` Zachary Amsden
2010-07-14 22:33 ` Rik van Riel
2010-07-18 14:52 ` Avi Kivity
2010-07-19 20:39 ` Zachary Amsden
2010-07-13 2:25 ` [PATCH 10/18] Keep SMP VMs more in sync on unstable TSC Zachary Amsden
2010-07-15 2:13 ` Rik van Riel
2010-07-13 2:25 ` [PATCH 11/18] Perform hardware_enable in CPU_STARTING callback Zachary Amsden
2010-07-15 2:15 ` Rik van Riel
2010-07-13 2:25 ` [PATCH 12/18] Add clock sync request to hardware enable Zachary Amsden
2010-07-15 2:32 ` Rik van Riel
2010-07-13 2:25 ` [PATCH 13/18] Move scale_delta into common header Zachary Amsden
2010-07-15 2:35 ` Rik van Riel
2010-07-13 2:25 ` [PATCH 14/18] Fix a possible backwards warp of kvmclock Zachary Amsden
2010-07-15 2:37 ` Rik van Riel
2010-07-13 2:25 ` [PATCH 15/18] Implement getnsboottime kernel API Zachary Amsden
2010-07-15 2:41 ` Rik van Riel
2010-07-18 15:07 ` Avi Kivity
2010-07-13 2:25 ` [PATCH 16/18] Use getnsboottime in KVM Zachary Amsden
2010-07-15 2:43 ` Rik van Riel
2010-07-13 2:25 ` [PATCH 17/18] Indicate reliable TSC in kvmclock Zachary Amsden
2010-07-15 2:44 ` Rik van Riel
2010-07-13 2:25 ` [PATCH 18/18] Add timekeeping documentation Zachary Amsden
2010-07-14 7:16 ` Takuya Yoshikawa
2010-07-14 20:28 ` Zachary Amsden [this message]
2010-07-16 13:19 ` KVM timekeeping fixes, V2 Joerg Roedel
2010-07-16 17:20 ` Zachary Amsden
2010-07-16 19:26 ` Joerg Roedel
2010-07-18 14:22 ` Avi Kivity
2010-07-18 15:08 ` Avi Kivity
2010-07-19 8:11 ` Zachary Amsden
-- strict thread matches above, loose matches on Subject: below --
2010-07-13 2:07 KVM timekeeping fixes Zachary Amsden
2010-07-13 2:08 ` [PATCH 18/18] Add timekeeping documentation Zachary Amsden
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4C3E1DD1.4070400@redhat.com \
--to=zamsden@redhat.com \
--cc=avi@redhat.com \
--cc=glommer@redhat.com \
--cc=kvm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mtosatti@redhat.com \
--cc=yoshikawa.takuya@oss.ntt.co.jp \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox