From: "Doug Smythies" <dsmythies@telus.net>
To: "'Chen, Yu C'" <yu.c.chen@intel.com>
Cc: "'Wysocki, Rafael J'" <rafael.j.wysocki@intel.com>,
tglx@linutronix.de, hpa@zytor.com, bp@alien8.de, "'Zhang,
Rui'" <rui.zhang@intel.com>,
linux-pm@vger.kernel.org, x86@kernel.org,
linux-kernel@vger.kernel.org, "'Brown,
Len'" <len.brown@intel.com>, 'Ingo Molnar' <mingo@kernel.org>,
'Pavel Machek' <pavel@ucw.cz>,
'Kristen Carlson Accardi' <kristen@linux.intel.com>,
"'Pandruvada, Srinivas'" <srinivas.pandruvada@intel.com>
Subject: RE: [PATCH] [v4] x86, suspend: Save/restore extra MSR registers for suspend
Date: Fri, 6 Nov 2015 07:33:58 -0800 [thread overview]
Message-ID: <000901d118a8$8fda8450$af8f8cf0$@net> (raw)
In-Reply-To: <36DF59CE26D8EE47B0655C516E9CE64028662059@shsmsx102.ccr.corp.intel.com>
On 2015.11.01 08:50 Chen, Yu C wrote:
>> On 2015.10.10 19:27 Chen, Yu C wrote:
>>> On 2105.10.10 02:56 Doug Smythies wrote:
>>>
>>>>> The current version of the intel_pstate driver is incompatible with
>>>>> any use of Clock Modulation, always resulting in driving the target
>>>>> pstate to the minimum, regardless of load. The result is the
>>>>> apparent CPU frequency stuck at minimum * modulation percent.
>>>>
>>>>> The acpi-cpufreq driver works fine with Clock Modulation, resulting
>>>>> in desired frequency * modulation percent.
>>>>
>>
>>> [Yu] Why intel_pstate driver is incompatible with Clock Modulation?
>>
>> It is simply how the current control algorithm responds to the scenario.
>>
>> The problem is in intel_pstate_get_scaled_busy, here:
>>
>> /*
>> * core_busy is the ratio of actual performance to max
>> * max_pstate is the max non turbo pstate available
>> * current_pstate was the pstate that was requested during
>> * the last sample period.
>> *
>> * We normalize core_busy, which was our actual percent
>> * performance to what we requested during the last sample
>> * period. The result will be a percentage of busy at a
>> * specified pstate.
>> */
>> core_busy = cpu->sample.core_pct_busy;
>> max_pstate = int_tofp(cpu->pstate.max_pstate);
>> current_pstate = int_tofp(cpu->pstate.current_pstate);
>> core_busy = mul_fp(core_busy, div_fp(max_pstate, current_pstate));
>>
>> With Clock Modulation enabled, the actual performance percent will always
>> be less than what was asked for, basically meaning current_pstate is much
>> less than what was asked for. Thus the algorithm will drive down the target
>> pstate regardless of load.
>>
> [Yu] Do you mean, there is some problem with the normalization,and we should use
> the actual pstate rather than the theoretical current_pstate, for example,
> the pseudocode might looked like:
>
> - current_pstate = int_tofp(cpu->pstate.current_pstate);
> + current_pstate = int_tofp(cpu->pstate.current_pstat)*0.85;
I did not think of normalizing / compensating at this point.
That is a good idea.
Just for a test, I tried it and it seems to work well.
Before normalizing / compensating core_busy can be quite a small
for lesser clock modulation duty cycles, and so becomes a little
noisy afterwards.
For my test, on an otherwise unaltered kernel v4.3 I did this:
diff --git a/drivers/cpufreq/intel_pstate.c b/drivers/cpufreq/intel_pstate.c
index aa33b92..97a90e1 100644
--- a/drivers/cpufreq/intel_pstate.c
+++ b/drivers/cpufreq/intel_pstate.c
@@ -821,6 +821,7 @@ static inline int32_t intel_pstate_get_scaled_busy(struct cpudata *cpu)
int32_t core_busy, max_pstate, current_pstate, sample_ratio;
s64 duration_us;
u32 sample_time;
+ u64 clock_modulation;
/*
* core_busy is the ratio of actual performance to max
@@ -836,6 +837,17 @@ static inline int32_t intel_pstate_get_scaled_busy(struct cpudata *cpu)
core_busy = cpu->sample.core_pct_busy;
max_pstate = int_tofp(cpu->pstate.max_pstate);
current_pstate = int_tofp(cpu->pstate.current_pstate);
+
+// rdmsrl(MSR_IA32_CLOCK_MODULATION, clock_modulation);
+ rdmsrl(MSR_IA32_THERM_CONTROL, clock_modulation);
+ if(clock_modulation && 0X10) {
+ clock_modulation = clock_modulation & 0x0F;
+ if(clock_modulation == 0) clock_modulation = 8;
+ core_busy = mul_fp(core_busy, int_tofp(0x10));
+ core_busy = div_fp(core_busy, int_tofp(clock_modulation));
+ }
+
core_busy = mul_fp(core_busy, div_fp(max_pstate, current_pstate));
/*
next prev parent reply other threads:[~2015-11-06 15:34 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-08-27 3:18 [PATCH] [v4] x86, suspend: Save/restore extra MSR registers for suspend Chen Yu
2015-09-17 5:30 ` Pavel Machek
2015-10-09 9:39 ` Chen, Yu C
2015-10-09 18:55 ` Doug Smythies
2015-10-11 2:26 ` Chen, Yu C
2015-10-11 15:46 ` Doug Smythies
2015-11-01 16:49 ` Chen, Yu C
2015-11-06 15:33 ` Doug Smythies [this message]
2015-11-12 9:42 ` Chen, Yu C
2015-11-21 16:45 ` Doug Smythies
2015-11-27 3:28 ` Doug Smythies
2015-11-27 6:01 ` Yu Chen
2015-10-09 21:50 ` Rafael J. Wysocki
2015-10-11 2:43 ` Chen, Yu C
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='000901d118a8$8fda8450$af8f8cf0$@net' \
--to=dsmythies@telus.net \
--cc=bp@alien8.de \
--cc=hpa@zytor.com \
--cc=kristen@linux.intel.com \
--cc=len.brown@intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-pm@vger.kernel.org \
--cc=mingo@kernel.org \
--cc=pavel@ucw.cz \
--cc=rafael.j.wysocki@intel.com \
--cc=rui.zhang@intel.com \
--cc=srinivas.pandruvada@intel.com \
--cc=tglx@linutronix.de \
--cc=x86@kernel.org \
--cc=yu.c.chen@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).