From: Like Xu <like.xu.linux@gmail.com>
To: Jim Mattson <jmattson@google.com>,
Sean Christopherson <sean.j.christopherson@intel.com>
Cc: Paolo Bonzini <pbonzini@redhat.com>,
kvm list <kvm@vger.kernel.org>,
Vitaly Kuznetsov <vkuznets@redhat.com>,
Wanpeng Li <wanpengli@tencent.com>,
Joerg Roedel <joro@8bytes.org>,
wei.huang2@amd.com, Peter Zijlstra <peterz@infradead.org>,
Thomas Gleixner <tglx@linutronix.de>,
LKML <linux-kernel@vger.kernel.org>,
Li RongQing <lirongqing@baidu.com>, Like Xu <likexu@tencent.com>
Subject: Re: [PATCH] KVM: X86: Emulate APERF/MPERF to report actual VCPU frequency
Date: Wed, 22 Dec 2021 14:56:55 +0800 [thread overview]
Message-ID: <9e898dae-c8a9-7b04-6165-ca4d7ddce988@gmail.com> (raw)
In-Reply-To: <CALMp9eTYKQ3LrWKu32mJKPzkWMcN5tGSFmj352TPCSrSp7jGxw@mail.gmail.com>
On 24/6/2020 4:34 am, Jim Mattson wrote:
> On Tue, Jun 23, 2020 at 12:05 PM Sean Christopherson
> <sean.j.christopherson@intel.com> wrote:
>>
>> On Tue, Jun 23, 2020 at 11:39:16AM -0700, Jim Mattson wrote:
>>> On Tue, Jun 23, 2020 at 11:29 AM Sean Christopherson
>>> <sean.j.christopherson@intel.com> wrote:
>>>>
>>>> On Tue, Jun 23, 2020 at 02:35:30PM +0800, Like Xu wrote:
>>>>> The aperf/mperf are used to report current CPU frequency after 7d5905dc14a
>>>>> "x86 / CPU: Always show current CPU frequency in /proc/cpuinfo". But guest
>>>>> kernel always reports a fixed VCPU frequency in the /proc/cpuinfo, which
>>>>> may confuse users especially when turbo is enabled on the host.
>>>>>
>>>>> Emulate guest APERF/MPERF capability based their values on the host.
>>>>>
>>>>> Co-developed-by: Li RongQing <lirongqing@baidu.com>
>>>>> Signed-off-by: Li RongQing <lirongqing@baidu.com>
>>>>> Reviewed-by: Chai Wen <chaiwen@baidu.com>
>>>>> Reviewed-by: Jia Lina <jialina01@baidu.com>
>>>>> Signed-off-by: Like Xu <like.xu@linux.intel.com>
>>>>> ---
>>>>
>>>> ...
>>>>
>>>>> @@ -8312,7 +8376,7 @@ static int vcpu_enter_guest(struct kvm_vcpu *vcpu)
>>>>> dm_request_for_irq_injection(vcpu) &&
>>>>> kvm_cpu_accept_dm_intr(vcpu);
>>>>> fastpath_t exit_fastpath;
>>>>> -
>>>>> + u64 enter_mperf = 0, enter_aperf = 0, exit_mperf = 0, exit_aperf = 0;
>>>>> bool req_immediate_exit = false;
>>>>>
>>>>> if (kvm_request_pending(vcpu)) {
>>>>> @@ -8516,8 +8580,17 @@ static int vcpu_enter_guest(struct kvm_vcpu *vcpu)
>>>>> vcpu->arch.switch_db_regs &= ~KVM_DEBUGREG_RELOAD;
>>>>> }
>>>>>
>>>>> + if (unlikely(vcpu->arch.hwp.hw_coord_fb_cap))
>>>>> + get_host_amperf(&enter_mperf, &enter_aperf);
>>>>> +
>>>>> exit_fastpath = kvm_x86_ops.run(vcpu);
>>>>>
>>>>> + if (unlikely(vcpu->arch.hwp.hw_coord_fb_cap)) {
>>>>> + get_host_amperf(&exit_mperf, &exit_aperf);
>>>>> + vcpu_update_amperf(vcpu, get_amperf_delta(enter_aperf, exit_aperf),
>>>>> + get_amperf_delta(enter_mperf, exit_mperf));
>>>>> + }
>>>>> +
>>>>
>>>> Is there an alternative approach that doesn't require 4 RDMSRs on every VMX
>>>> round trip? That's literally more expensive than VM-Enter + VM-Exit
>>>> combined.
It looks like we have quite a few users who are expecting this feature in
different scenarios.
I will add a fast path for RO usage and a slow path if the guest tries to change
the AMPERF values.
>>>>
>>>> E.g. what about adding KVM_X86_DISABLE_EXITS_APERF_MPERF and exposing the
>>>> MSRs for read when that capability is enabled?
>>>
>>> When would you load the hardware MSRs with the guest/host values?
>>
>> Ugh, I was thinking the MSRs were read-only.
>
> EVen if they were read-only, they should power on to zero, and they
> will most likely not be zero when a guest powers on.
Can we assume that "not zero when the guest is on" will not harm any guests ?
>
>> Doesn't this also interact with TSC scaling?
>
> Yes, it should!
We have too much of a historical burden on TSC emulations.
For practical reasons, what if we only expose the AMPERF cap
if the host/guest has both CONSTANT_TSC and NONSTOP_TSC ?
One more design concern, I wonder if it is *safe* for the guest to
read amperf on pCPU[x] the first time and on pCPU[y] the next time.
Any input ?
Thanks,
Like Xu
prev parent reply other threads:[~2021-12-22 6:57 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-06-23 6:35 [PATCH] KVM: X86: Emulate APERF/MPERF to report actual VCPU frequency Like Xu
2020-06-23 18:29 ` Sean Christopherson
2020-06-23 18:39 ` Jim Mattson
2020-06-23 19:05 ` Sean Christopherson
2020-06-23 20:34 ` Jim Mattson
2021-12-22 6:56 ` Like Xu [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=9e898dae-c8a9-7b04-6165-ca4d7ddce988@gmail.com \
--to=like.xu.linux@gmail.com \
--cc=jmattson@google.com \
--cc=joro@8bytes.org \
--cc=kvm@vger.kernel.org \
--cc=likexu@tencent.com \
--cc=linux-kernel@vger.kernel.org \
--cc=lirongqing@baidu.com \
--cc=pbonzini@redhat.com \
--cc=peterz@infradead.org \
--cc=sean.j.christopherson@intel.com \
--cc=tglx@linutronix.de \
--cc=vkuznets@redhat.com \
--cc=wanpengli@tencent.com \
--cc=wei.huang2@amd.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox