From: Zachary Amsden <zamsden@redhat.com>
To: Avi Kivity <avi@redhat.com>
Cc: mtosatti@redhat.com, glommer@redhat.com, kvm@vger.kernel.org,
linux-kernel@vger.kernel.org, Joerg Roedel <joerg.roedel@amd.com>
Subject: Re: [PATCH 15/17] Fix AMD C1 TSC desynchronization
Date: Mon, 14 Jun 2010 23:21:11 -1000 [thread overview]
Message-ID: <4C174607.4040107@redhat.com> (raw)
In-Reply-To: <4C173E18.6000804@redhat.com>
On 06/14/2010 10:47 PM, Avi Kivity wrote:
> On 06/15/2010 10:34 AM, Zachary Amsden wrote:
>> Some AMD based machines can have TSC drift when in C1 HLT state because
>> despite attempting to scale the TSC increment when dividing down the
>> P-state, the processor may return to full P-state to service cache
>> probes. The TSC of halted CPUs can advance faster than that of running
>> CPUs as a result, causing unpredictable TSC drift.
>>
>> We implement a recommended workaround, which is disabling C1 clock
>> ramping.
>> ---
>> arch/x86/kvm/x86.c | 45
>> +++++++++++++++++++++++++++++++++++++++++++++
>> 1 files changed, 45 insertions(+), 0 deletions(-)
>>
>> diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c
>> index ef847ee..8e836e9 100644
>> --- a/arch/x86/kvm/x86.c
>> +++ b/arch/x86/kvm/x86.c
>> @@ -56,6 +56,11 @@
>> #include<asm/i387.h>
>> #include<asm/xcr.h>
>>
>> +#ifdef CONFIG_K8_NB
>> +#include<linux/pci.h>
>> +#include<asm/k8.h>
>> +#endif
>> +
>> #define MAX_IO_MSRS 256
>> #define CR0_RESERVED_BITS \
>> (~(unsigned long)(X86_CR0_PE | X86_CR0_MP | X86_CR0_EM |
>> X86_CR0_TS \
>> @@ -4287,10 +4292,43 @@ static struct notifier_block
>> kvmclock_cpu_notifier_block = {
>> .priority = -INT_MAX
>> };
>>
>> +static u8 disabled_c1_ramp = 0;
>> +
>> static void kvm_timer_init(void)
>> {
>> int cpu;
>>
>> + /*
>> + * AMD processors can de-synchronize TSC on halt in C1 state,
>> because
>> + * processors in lower P state will have TSC scaled properly during
>> + * normal operation, but will have TSC scaled improperly while
>> + * servicing cache probes. Because there is no way to determine
>> how
>> + * TSC was adjusted during cache probes, there are two solutions:
>> + * resynchronize after halt, or disable C1-clock ramping.
>> + *
>> + * We implemenent solution 2.
>> + */
>> +#ifdef CONFIG_K8_NB
>> + if (boot_cpu_data.x86_vendor == X86_VENDOR_AMD&&
>> + boot_cpu_data.x86 == 0x0f&&
>> + !boot_cpu_has(X86_FEATURE_CONSTANT_TSC)) {
>> + struct pci_dev *nb;
>> + int i;
>> + cache_k8_northbridges();
>> + for (i = 0; i< num_k8_northbridges; i++) {
>> + u8 byte;
>> + nb = k8_northbridges[i];
>> + pci_read_config_byte(nb, 0x87,&byte);
>> + if (byte& 1) {
>> + printk(KERN_INFO "%s: AMD C1 clock ramping detected,
>> performing workaround\n", __func__);
>> + disabled_c1_ramp = byte;
>> + pci_write_config_byte(nb, 0x87, byte& 0xFC);
>> +
>> + }
>> + }
>> + }
>> +#endif
>> +
>> register_hotcpu_notifier(&kvmclock_cpu_notifier_block);
>> if (!boot_cpu_has(X86_FEATURE_CONSTANT_TSC)) {
>> cpufreq_register_notifier(&kvmclock_cpufreq_notifier_block,
>> @@ -4402,6 +4440,13 @@ void kvm_arch_exit(void)
>> unregister_hotcpu_notifier(&kvmclock_cpu_notifier_block);
>> kvm_x86_ops = NULL;
>> kvm_mmu_module_exit();
>> +#ifdef CONFIG_K8_NB
>> + if (disabled_c1_ramp) {
>> + struct pci_dev **nb;
>> + for (nb = k8_northbridges; *nb; nb++)
>> + pci_write_config_byte(*nb, 0x87, disabled_c1_ramp);
>> + }
>> +#endif
>> }
>
> Such platform hackery should be in the platform code, not in kvm. kvm
> might request to enable it (why not enable it unconditionally? should
> we disable it on hardware_disable()?
I actually have some negative effects to report from this patch - when
under stress, my laptop spontaneously shut down. Thermal problems?
I agree it is complete hackery. I do not recommend this patch for
upstream inclusion unless it is proposed also by someone more familiar
with the hardware.
However, it was required for my testing.
next prev parent reply other threads:[~2010-06-15 9:21 UTC|newest]
Thread overview: 70+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-06-15 7:34 TSC cleanups, fixes, documentation for KVM Zachary Amsden
2010-06-15 7:34 ` [PATCH 01/17] Eliminate duplicated timer code Zachary Amsden
2010-06-16 13:07 ` Glauber Costa
2010-06-15 7:34 ` [PATCH 02/17] Make cpu_tsc_khz updates use local CPU Zachary Amsden
2010-06-15 8:02 ` Avi Kivity
2010-06-15 7:34 ` [PATCH 03/17] Unify vendor TSC logic Zachary Amsden
2010-06-16 8:10 ` Jason Wang
2010-06-16 13:22 ` Glauber Costa
2010-06-17 8:03 ` Jason Wang
2010-06-16 18:42 ` Zachary Amsden
2010-06-17 8:15 ` Jason Wang
2010-06-17 20:30 ` Zachary Amsden
2010-06-15 7:34 ` [PATCH 04/17] Fix deep C-state TSC desynchronization Zachary Amsden
2010-06-15 8:09 ` Avi Kivity
2010-06-15 8:14 ` Zachary Amsden
2010-06-16 8:10 ` Jason Wang
2010-06-16 18:43 ` Zachary Amsden
2010-06-16 13:24 ` Glauber Costa
2010-06-16 19:20 ` Zachary Amsden
2010-06-15 7:34 ` [PATCH 05/17] Keep SMP VMs more in sync on unstable TSC Zachary Amsden
2010-06-15 8:11 ` Avi Kivity
2010-06-16 8:11 ` Jason Wang
2010-06-16 13:32 ` Glauber Costa
2010-06-16 21:15 ` Zachary Amsden
2010-06-15 7:34 ` [PATCH 06/17] Rename KVM_REQ_KVMCLOCK_UPDATE Zachary Amsden
2010-06-15 7:34 ` [PATCH 07/17] Perform hardware_enable in CPU_STARTING callback Zachary Amsden
2010-06-15 7:34 ` [PATCH 08/17] Add clock sync request to hardware enable Zachary Amsden
2010-06-15 8:24 ` Avi Kivity
2010-06-15 7:34 ` [PATCH 09/17] Move scale_delta into common header Zachary Amsden
2010-06-15 7:34 ` [PATCH 10/17] Make KVM clock computation work for other scales Zachary Amsden
2010-06-15 7:34 ` [PATCH 11/17] Fix a possible backwards warp of kvmclock Zachary Amsden
2010-06-15 8:40 ` Avi Kivity
2010-06-15 20:37 ` Zachary Amsden
2010-06-15 23:47 ` Marcelo Tosatti
2010-06-16 0:21 ` Zachary Amsden
2010-06-16 0:39 ` Marcelo Tosatti
2010-06-16 8:11 ` Jason Wang
2010-06-16 13:58 ` Glauber Costa
2010-06-16 14:13 ` Avi Kivity
2010-06-16 14:58 ` Glauber Costa
2010-06-16 22:38 ` Zachary Amsden
2010-06-16 19:36 ` Zachary Amsden
2010-06-15 7:34 ` [PATCH 12/17] Add helper function get_kernel_ns Zachary Amsden
2010-06-15 8:41 ` Avi Kivity
2010-06-15 21:03 ` Zachary Amsden
2010-06-15 21:13 ` john stultz
2010-06-16 8:12 ` Jason Wang
2010-06-16 14:03 ` Glauber Costa
2010-06-15 7:34 ` [PATCH 13/17] Add TSC offset tracking Zachary Amsden
2010-06-15 8:44 ` Avi Kivity
2010-06-15 7:34 ` [PATCH 14/17] Fix SVM VMCB reset Zachary Amsden
2010-06-15 7:34 ` [PATCH 15/17] Fix AMD C1 TSC desynchronization Zachary Amsden
2010-06-15 8:47 ` Avi Kivity
2010-06-15 9:21 ` Zachary Amsden [this message]
2010-06-15 14:46 ` Roedel, Joerg
2010-06-15 7:34 ` [PATCH 16/17] TSC reset compensation Zachary Amsden
2010-06-15 8:51 ` Avi Kivity
2010-06-15 20:32 ` Zachary Amsden
2010-06-16 0:27 ` Marcelo Tosatti
2010-06-16 0:32 ` Zachary Amsden
2010-06-16 13:52 ` Glauber Costa
2010-06-16 22:36 ` Zachary Amsden
2010-06-15 7:34 ` [PATCH 17/17] Add timekeeping documentation Zachary Amsden
2010-06-15 8:51 ` Avi Kivity
2010-06-15 20:27 ` Randy Dunlap
2010-06-16 23:59 ` Zachary Amsden
2010-06-17 8:55 ` Andi Kleen
2010-06-17 21:14 ` Zachary Amsden
2010-06-18 7:49 ` Andi Kleen
2010-06-18 16:33 ` Zachary Amsden
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4C174607.4040107@redhat.com \
--to=zamsden@redhat.com \
--cc=avi@redhat.com \
--cc=glommer@redhat.com \
--cc=joerg.roedel@amd.com \
--cc=kvm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=mtosatti@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox