From: Beata Michalska <beata.michalska@arm.com>
To: Lifeng Zheng <zhenglifeng1@huawei.com>
Cc: catalin.marinas@arm.com, will@kernel.org, rafael@kernel.org,
viresh.kumar@linaro.org, sudeep.holla@arm.com,
linux-arm-kernel@lists.infradead.org, linux-pm@vger.kernel.org,
linux-kernel@vger.kernel.org, linuxarm@huawei.com,
jonathan.cameron@huawei.com, vincent.guittot@linaro.org,
yangyicong@hisilicon.com, zhanjie9@hisilicon.com,
lihuisong@huawei.com, yubowen8@huawei.com,
zhangpengjie2@huawei.com, linhongye@h-partners.com
Subject: Re: [PATCH v5 3/3] arm64: topology: Setup AMU FIE for online CPUs only
Date: Wed, 20 Aug 2025 11:21:23 +0200 [thread overview]
Message-ID: <aKWTk5dfZysd_8ks@arm.com> (raw)
In-Reply-To: <20250819072931.1647431-4-zhenglifeng1@huawei.com>
The commit title does read a bit wrong I think.
It does not really reflect the change for supporting cpu hotplug.
Maybe smth along the lines of:
arm64: topology: Handle AMU FIE setup on CPU hotplug
?
On Tue, Aug 19, 2025 at 03:29:31PM +0800, Lifeng Zheng wrote:
> When boot with maxcpu=1 restrict, and LPI(Low Power Idle States) is on,
So actually it is `maxcpus` to start with, Not sure what LPI has to do with any
of that ?
It might be better to slightly reword the whole commit message.
The problem is with CPUs being offline at the time the cpufreq policy is being
created so it might be with maxcpus != nr_cpus , or because cpu bring-up
failed, or due to RAS event that caused the cpu to go offline, etc.
How about:
"When a cpufreq policy is created, AMU FIE setup currently verifies all
CPUs in the policy, regardless of whether they are online. If any of
those CPUs are offline, their AMU capability flag is not yet verified, and
the check fails. As a result, AMU FIE is not enabled even if the CPUs
that are online do support it.
Later, when the offline CPUs eventually come online and advertise AMU
support, they have no opportunity to re-enable AMU FIE for the policy,
leaving the whole frequency domain without AMU FIE despite being
eligible.
Restrict the initial AMU FIE check to the CPUs that are online at the
time the policy is created, and allow CPUs brought online later to join
the policy with AMU FIE enabled."
> only CPU0 will go online. The support AMU flag of CPU0 will be set but the
> flags of other CPUs will not. This will cause AMU FIE set up fail for CPU0
> when it shares a cpufreq policy with other CPU(s). After that, when other
> CPUs are finally online and the support AMU flags of them are set, they'll
> never have a chance to set up AMU FIE, even though they're eligible.
>
> To solve this problem, the process of setting up AMU FIE needs to be
> modified as follows:
>
> 1. Set up AMU FIE only for the online CPUs.
>
> 2. Try to set up AMU FIE each time a CPU goes online and do the
> freq_counters_valid() check. If this check fails, clear scale freq source
> of all the CPUs related to the same policy, in case they use different
> source of the freq scale.
>
> At the same time, this change also be applied to cpufreq when calling
> arch_set_freq_scale.
Could we clarify that a bit ? Reads a bit ambiguous.
>
> Signed-off-by: Lifeng Zheng <zhenglifeng1@huawei.com>
> ---
> arch/arm64/kernel/topology.c | 54 ++++++++++++++++++++++++++++++++++--
> drivers/cpufreq/cpufreq.c | 4 +--
> 2 files changed, 54 insertions(+), 4 deletions(-)
>
> diff --git a/arch/arm64/kernel/topology.c b/arch/arm64/kernel/topology.c
> index 9317a618bb87..a9d9e9969cea 100644
> --- a/arch/arm64/kernel/topology.c
> +++ b/arch/arm64/kernel/topology.c
> @@ -385,7 +385,7 @@ static int init_amu_fie_callback(struct notifier_block *nb, unsigned long val,
> struct cpufreq_policy *policy = data;
>
> if (val == CPUFREQ_CREATE_POLICY)
> - amu_fie_setup(policy->related_cpus);
> + amu_fie_setup(policy->cpus);
>
> /*
> * We don't need to handle CPUFREQ_REMOVE_POLICY event as the AMU
> @@ -404,10 +404,60 @@ static struct notifier_block init_amu_fie_notifier = {
> .notifier_call = init_amu_fie_callback,
> };
>
> +static int cpuhp_topology_online(unsigned int cpu)
> +{
> + struct cpufreq_policy *policy = cpufreq_cpu_policy(cpu);
> +
> + /*
> + * If the online CPUs are not all AMU FIE CPUs or the new one is already
> + * an AMU FIE one, no need to set it.
> + */
> + if (!policy || !cpumask_available(amu_fie_cpus) ||
> + !cpumask_subset(policy->cpus, amu_fie_cpus) ||
> + cpumask_test_cpu(cpu, amu_fie_cpus))
> + return 0;
I believe this can be slightly optimised and made more ... readable, i.e:
diff --git a/arch/arm64/kernel/topology.c b/arch/arm64/kernel/topology.c
index a9d9e9969cea..2d6ce34af8e4 100644
--- a/arch/arm64/kernel/topology.c
+++ b/arch/arm64/kernel/topology.c
@@ -408,15 +408,25 @@ static int cpuhp_topology_online(unsigned int cpu)
{
struct cpufreq_policy *policy = cpufreq_cpu_policy(cpu);
+ /* Those are cheap checks */
+
/*
- * If the online CPUs are not all AMU FIE CPUs or the new one is already
- * an AMU FIE one, no need to set it.
+ * Skip this CPU if:
+ * - it has no cpufreq policy assigned yet,
+ * - no policy exists that spans CPUs with AMU counters, or
+ * - it was already handled.
*/
- if (!policy || !cpumask_available(amu_fie_cpus) ||
- !cpumask_subset(policy->cpus, amu_fie_cpus) ||
+ if (unlikely(!policy || !cpumask_available(amu_fie_cpus)) ||
cpumask_test_cpu(cpu, amu_fie_cpus))
return 0;
+ /*
+ * Only proceed if all already-online CPUs in this policy
+ * support AMU counters.
+ */
+ if (unlikely(!cpumask_subset(policy->cpus, amu_fie_cpus)))
+ return 0;
+
/*
* If the new online CPU cannot pass this check, all the CPUs related to
* the same policy should be clear from amu_fie_cpus mask, otherwise they
---
BR
Beata
> +
> + /*
> + * If the new online CPU cannot pass this check, all the CPUs related to
> + * the same policy should be clear from amu_fie_cpus mask, otherwise they
> + * may use different source of the freq scale.
> + */
> + if (WARN_ON(!freq_counters_valid(cpu))) {
> + topology_clear_scale_freq_source(SCALE_FREQ_SOURCE_ARCH,
> + policy->related_cpus);
> + cpumask_andnot(amu_fie_cpus, amu_fie_cpus, policy->related_cpus);
> + return 0;
> + }
> +
> + cpumask_set_cpu(cpu, amu_fie_cpus);
> +
> + topology_set_scale_freq_source(&amu_sfd, cpumask_of(cpu));
> +
> + pr_debug("CPU[%u]: counter will be used for FIE.", cpu);
> +
> + return 0;
> +}
> +
> static int __init init_amu_fie(void)
> {
> - return cpufreq_register_notifier(&init_amu_fie_notifier,
> + int ret;
> +
> + ret = cpufreq_register_notifier(&init_amu_fie_notifier,
> CPUFREQ_POLICY_NOTIFIER);
> + if (ret)
> + return ret;
> +
> + ret = cpuhp_setup_state_nocalls(CPUHP_AP_ONLINE_DYN,
> + "arm64/topology:online",
> + cpuhp_topology_online,
> + NULL);
> + if (ret < 0) {
> + cpufreq_unregister_notifier(&init_amu_fie_notifier,
> + CPUFREQ_POLICY_NOTIFIER);
> + return ret;
> + }
> +
> + return 0;
> }
> core_initcall(init_amu_fie);
>
> diff --git a/drivers/cpufreq/cpufreq.c b/drivers/cpufreq/cpufreq.c
> index 78ca68ea754d..d1890a2af1af 100644
> --- a/drivers/cpufreq/cpufreq.c
> +++ b/drivers/cpufreq/cpufreq.c
> @@ -417,7 +417,7 @@ void cpufreq_freq_transition_end(struct cpufreq_policy *policy,
>
> cpufreq_notify_post_transition(policy, freqs, transition_failed);
>
> - arch_set_freq_scale(policy->related_cpus,
> + arch_set_freq_scale(policy->cpus,
> policy->cur,
> arch_scale_freq_ref(policy->cpu));
>
> @@ -2219,7 +2219,7 @@ unsigned int cpufreq_driver_fast_switch(struct cpufreq_policy *policy,
> return 0;
>
> policy->cur = freq;
> - arch_set_freq_scale(policy->related_cpus, freq,
> + arch_set_freq_scale(policy->cpus, freq,
> arch_scale_freq_ref(policy->cpu));
> cpufreq_stats_record_transition(policy, freq);
>
> --
> 2.33.0
>
next prev parent reply other threads:[~2025-08-20 9:21 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-08-19 7:29 [PATCH v5 0/3] arm64: topology: Setup AMU FIE for online CPUs only Lifeng Zheng
2025-08-19 7:29 ` [PATCH v5 1/3] arm64: topology: Set scale freq source only for the CPUs that have not been set before Lifeng Zheng
2025-08-20 9:22 ` Beata Michalska
2025-08-30 4:08 ` Jie Zhan
2025-08-19 7:29 ` [PATCH v5 2/3] cpufreq: Add a new function to get cpufreq policy without checking if the CPU is online Lifeng Zheng
2025-08-19 19:05 ` Rafael J. Wysocki
2025-08-20 1:50 ` zhenglifeng (A)
2025-08-30 4:11 ` Jie Zhan
2025-08-19 7:29 ` [PATCH v5 3/3] arm64: topology: Setup AMU FIE for online CPUs only Lifeng Zheng
2025-08-20 9:21 ` Beata Michalska [this message]
2025-08-30 10:20 ` Jie Zhan
2025-09-01 7:29 ` Ionela Voinescu
2025-09-05 7:31 ` Beata Michalska
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=aKWTk5dfZysd_8ks@arm.com \
--to=beata.michalska@arm.com \
--cc=catalin.marinas@arm.com \
--cc=jonathan.cameron@huawei.com \
--cc=lihuisong@huawei.com \
--cc=linhongye@h-partners.com \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-pm@vger.kernel.org \
--cc=linuxarm@huawei.com \
--cc=rafael@kernel.org \
--cc=sudeep.holla@arm.com \
--cc=vincent.guittot@linaro.org \
--cc=viresh.kumar@linaro.org \
--cc=will@kernel.org \
--cc=yangyicong@hisilicon.com \
--cc=yubowen8@huawei.com \
--cc=zhangpengjie2@huawei.com \
--cc=zhanjie9@hisilicon.com \
--cc=zhenglifeng1@huawei.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).