From: Sean Christopherson <sean.j.christopherson@intel.com>
To: Joao Martins <joao.m.martins@oracle.com>
Cc: Wanpeng Li <kernellwp@gmail.com>,
Liran Alon <liran.alon@oracle.com>,
Paolo Bonzini <pbonzini@redhat.com>,
Radim Krcmar <rkrcmar@redhat.com>,
kvm@vger.kernel.org, Boris Ostrovsky <boris.ostrovsky@oracle.com>,
Ankur Arora <ankur.a.arora@oracle.com>
Subject: Re: [PATCH] KVM: VMX: Nop emulation of MSR_IA32_POWER_CTL
Date: Fri, 10 May 2019 10:17:33 -0700 [thread overview]
Message-ID: <20190510171733.GA16852@linux.intel.com> (raw)
In-Reply-To: <d930e87a-fbe3-cf63-b8a0-26e9f012442a@oracle.com>
On Fri, May 10, 2019 at 11:34:41AM +0100, Joao Martins wrote:
> On 5/10/19 10:54 AM, Wanpeng Li wrote:
> > It is weird that we can observe intel_idle driver in the guest
> > executes mwait eax=0x20, and the corresponding pCPU enters C3 on HSW
> > server, however, we can't observe this on SKX/CLX server, it just
> > enters maximal C1.
>
> I assume you refer to the case where you pass the host mwait substates to the
> guests as is, right? Or are you zeroing/filtering out the mwait cpuid leaf EDX
> like my patch (attached in the previous message) suggests?
>
> Interestingly, hints set to 0x20 actually corresponds to C6 on HSW (based on
> intel_idle driver). IIUC From the SDM (see Vol 2B, "MWAIT for Power Management"
> in instruction set reference M-U) the hints register, doesn't necessarily
> guarantee the specified C-state depicted in the hints will be used. The manual
> makes it sound like it is tentative, and implementation-specific condition may
> either ignore it or enter a different one. It appears to be only guaranteed that
> it won't enter a C-{sub,}state deeper than the one depicted.
Yep, section "MWAIT EXTENSIONS FOR ADVANCED POWER MANAGEMENT" is more
explicit on this point:
At CPL=0, system software can specify desired C-state and sub C-state by
using the MWAIT hints register (EAX). Processors will not go to C-state
and sub C-state deeper than what is specified by the hint register.
As for why SKX/CLX only enters C1, AFAICT SKX isn't configured to support
C3, e.g. skx_cstates in drivers/idle/intel_idle.c shows C1, C1E and C6.
A quick search brings up a variety of docs that confirm this. My guess is
that C1E provides better power/performance than C3 for the majority of
server workloads, e.g. C3 doesn't provide enough power savings to justify
its higher latency and TLB flush.
next prev parent reply other threads:[~2019-05-10 17:17 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-04-15 15:45 [PATCH] KVM: VMX: Nop emulation of MSR_IA32_POWER_CTL Liran Alon
2019-04-15 18:17 ` Sean Christopherson
2019-04-16 15:21 ` Liran Alon
2019-04-16 15:23 ` Paolo Bonzini
2019-04-16 15:29 ` Liran Alon
2019-04-16 15:40 ` Liran Alon
2019-04-16 19:16 ` Joao Martins
2019-05-10 9:54 ` Wanpeng Li
2019-05-10 10:34 ` Joao Martins
2019-05-10 17:17 ` Sean Christopherson [this message]
2019-05-13 9:13 ` Wanpeng Li
2019-05-15 14:30 ` Sean Christopherson
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20190510171733.GA16852@linux.intel.com \
--to=sean.j.christopherson@intel.com \
--cc=ankur.a.arora@oracle.com \
--cc=boris.ostrovsky@oracle.com \
--cc=joao.m.martins@oracle.com \
--cc=kernellwp@gmail.com \
--cc=kvm@vger.kernel.org \
--cc=liran.alon@oracle.com \
--cc=pbonzini@redhat.com \
--cc=rkrcmar@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox