From: Quentin Perret <qperret@google.com>
To: lukasz.luba@arm.com
Cc: nm@ti.com, juri.lelli@redhat.com, daniel.lezcano@linaro.org,
peterz@infradead.org, viresh.kumar@linaro.org,
liviu.dudau@arm.com, dri-devel@lists.freedesktop.org,
bjorn.andersson@linaro.org, bsegall@google.com,
alyssa.rosenzweig@collabora.com, festevam@gmail.com,
Morten.Rasmussen@arm.com, robh@kernel.org,
amit.kucheria@verdurent.com, lorenzo.pieralisi@arm.com,
khilman@kernel.org, agross@kernel.org, b.zolnierkie@samsung.com,
steven.price@arm.com, cw00.choi@samsung.com, mingo@redhat.com,
linux-imx@nxp.com, rui.zhang@intel.com, mgorman@suse.de,
daniel@ffwll.ch, linux-pm@vger.kernel.org,
linux-arm-msm@vger.kernel.org, s.hauer@pengutronix.de,
rostedt@goodmis.org, linux-mediatek@lists.infradead.org,
matthias.bgg@gmail.com, Chris.Redpath@arm.com,
linux-omap@vger.kernel.org, Dietmar.Eggemann@arm.com,
linux-arm-kernel@lists.infradead.org, airlied@linux.ie,
javi.merino@arm.com, tomeu.vizoso@collabora.com,
sboyd@kernel.org, shawnguo@kernel.org, rjw@rjwysocki.net,
linux-kernel@vger.kernel.org, kernel@pengutronix.de,
sudeep.holla@arm.com, patrick.bellasi@matbug.net,
ionela.voinescu@arm.com
Subject: Re: [PATCH v2 1/4] PM / EM: add devices to Energy Model
Date: Fri, 7 Feb 2020 12:04:30 +0000 [thread overview]
Message-ID: <20200207120430.GA242912@google.com> (raw)
In-Reply-To: <20200206134640.11367-2-lukasz.luba@arm.com>
On Thursday 06 Feb 2020 at 13:46:37 (+0000), lukasz.luba@arm.com wrote:
> 2. Core APIs
> @@ -70,14 +72,16 @@ CONFIG_ENERGY_MODEL must be enabled to use the EM framework.
> Drivers are expected to register performance domains into the EM framework by
> calling the following API::
>
> - int em_register_perf_domain(cpumask_t *span, unsigned int nr_states,
> - struct em_data_callback *cb);
> + int em_register_perf_domain(struct device *dev, unsigned int nr_states,
> + struct em_data_callback *cb, cpumask_t *cpus);
>
> -Drivers must specify the CPUs of the performance domains using the cpumask
> +Drivers must specify the device pointer of the performance domains as first
I find this sentence a little odd no?
> argument, and provide a callback function returning <frequency, power> tuples
> -for each capacity state. The callback function provided by the driver is free
> +for each performance state. The callback function provided by the driver is free
> to fetch data from any relevant location (DT, firmware, ...), and by any mean
> -deemed necessary. See Section 3. for an example of driver implementing this
> +deemed necessary. For other devices than CPUs the last argumant must be set to
s/argumant/argument
> +NULL. Only for CPUfreq drivers it is obligatory to specify the cpumask.
Please note that as of today nothing mandates the caller to be a CPUFreq
driver -- it could be anything in theory. I'd say 'only for CPU devices'
instead.
<snip>
> @@ -24,51 +27,65 @@ struct em_cap_state {
>
> /**
> * em_perf_domain - Performance domain
> - * @table: List of capacity states, in ascending order
> - * @nr_cap_states: Number of capacity states
> - * @cpus: Cpumask covering the CPUs of the domain
> + * @table: List of performance states, in ascending order
> + * @nr_perf_states: Number of performance states
> + * @priv: In case of EM for CPU device it is a Cpumask
> + * covering the CPUs of the domain
Could you turn @priv back into 'unsigned long priv[0];' and keep the
allocation as it is today ? That is, append the cpumask to the struct.
This empty pointer for non-CPU devices is just wasted space, and pointer
chasing isn't good for your caches. Given that you pre-allocate the pd
in em_create_pd() you could just have a special case for CPUs there I
suppose. And _is_cpu_em() will have to check the bus like you did in v1.
> *
> - * A "performance domain" represents a group of CPUs whose performance is
> - * scaled together. All CPUs of a performance domain must have the same
> - * micro-architecture. Performance domains often have a 1-to-1 mapping with
> - * CPUFreq policies.
> + * In case of CPU device, a "performance domain" represents a group of CPUs
> + * whose performance is scaled together. All CPUs of a performance domain
> + * must have the same micro-architecture. Performance domains often have
> + * a 1-to-1 mapping with CPUFreq policies.
> + * In case of other devices the 'priv' field is unused.
> */
> struct em_perf_domain {
> - struct em_cap_state *table;
> - int nr_cap_states;
> - unsigned long cpus[0];
> + struct em_perf_state *table;
> + int nr_perf_states;
> + void *priv;
> };
<snip>
> struct em_data_callback {
> /**
> - * active_power() - Provide power at the next capacity state of a CPU
> - * @power : Active power at the capacity state in mW (modified)
> - * @freq : Frequency at the capacity state in kHz (modified)
> - * @cpu : CPU for which we do this operation
> + * active_power() - Provide power at the next performance state of a
> + * device
> + * @power : Active power at the performance state in mW (modified)
> + * @freq : Frequency at the performance state in kHz (modified)
> + * @dev : Device for which we do this operation (can be a CPU)
> *
> - * active_power() must find the lowest capacity state of 'cpu' above
> + * active_power() must find the lowest performance state of 'dev' above
> * 'freq' and update 'power' and 'freq' to the matching active power
> * and frequency.
> *
> - * The power is the one of a single CPU in the domain, expressed in
> - * milli-watts. It is expected to fit in the [0, EM_CPU_MAX_POWER]
> - * range.
> + * In case of CPUs, the power is the one of a single CPU in the domain,
> + * expressed in milli-watts. It is expected to fit in the
> + * [0, EM_MAX_POWER] range.
> *
> * Return 0 on success.
> */
> - int (*active_power)(unsigned long *power, unsigned long *freq, int cpu);
> + int (*active_power)(unsigned long *power, unsigned long *freq,
> + struct device *dev);
Given that you've made explicit in the doc of struct em_perf_state that
'power' can be a 'total' value (static + dynamic), this could be renamed
I suppose.
<snip>
> /**
> * em_cpu_get() - Return the performance domain for a CPU
> * @cpu : CPU to find the performance domain for
> *
> - * Return: the performance domain to which 'cpu' belongs, or NULL if it doesn't
> + * Returns the performance domain to which 'cpu' belongs, or NULL if it doesn't
> * exist.
> */
> struct em_perf_domain *em_cpu_get(int cpu)
> {
> - return READ_ONCE(per_cpu(em_data, cpu));
Since CPU perf domains are guaranteed to never go away, it'd be safe to
keep that per-CPU variable and avoid the locking and list manipulation
below. No strong opinion, though.
> + struct em_device *em_dev;
> +
> + mutex_lock(&em_pd_mutex);
> +
> + if (list_empty(&em_pd_dev_list))
> + goto unlock;
> +
> + list_for_each_entry(em_dev, &em_pd_dev_list, em_dev_list) {
> + if (!_is_cpu_em(em_dev->em_pd))
> + continue;
> +
> + if (cpumask_test_cpu(cpu, em_span_cpus(em_dev->em_pd))) {
> + mutex_unlock(&em_pd_mutex);
> + return em_dev->em_pd;
> + }
> + }
> +
> +unlock:
> + mutex_unlock(&em_pd_mutex);
> + return NULL;
> }
> EXPORT_SYMBOL_GPL(em_cpu_get);
<snip>
> /**
> - * em_register_perf_domain() - Register the Energy Model of a performance domain
> - * @span : Mask of CPUs in the performance domain
> - * @nr_states : Number of capacity states to register
> + * em_register_perf_domain() - Register the Energy Model (EM) of a performance
> + * domain for the device
> + * @dev : Device for which the EM is to register
> + * @nr_states : Number of performance states to register
> * @cb : Callback functions providing the data of the Energy Model
> + * @cpus : Pointer to cpumask_t, which in case of a CPU device is
> + * obligatory. It can be taken from i.e. 'policy->cpus'. For other
It should be policy->related_cpus actually (or 'real_cpus' even) -- PM_EM
ignores hotplug ATM. Perhaps we should document that somewhere ...
> + * type of devices this should be set to NULL.
> *
> * Create Energy Model tables for a performance domain using the callbacks
> * defined in cb.
> @@ -196,63 +361,129 @@ EXPORT_SYMBOL_GPL(em_cpu_get);
Thanks,
Quentin
_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
next prev parent reply other threads:[~2020-02-07 12:04 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-02-06 13:46 [PATCH v2 0/4] Add support for devices in the Energy Model lukasz.luba
2020-02-06 13:46 ` [PATCH v2 1/4] PM / EM: add devices to " lukasz.luba
2020-02-07 12:04 ` Quentin Perret [this message]
2020-02-07 15:59 ` Lukasz Luba
2020-02-13 10:59 ` Dietmar Eggemann
2020-02-13 15:25 ` Lukasz Luba
2020-02-06 13:46 ` [PATCH v2 2/4] OPP: change parameter to device pointer in dev_pm_opp_of_register_em() lukasz.luba
2020-02-06 13:46 ` [PATCH v2 3/4] thermal: devfreq_cooling: Refactor code and switch to use Energy Model lukasz.luba
2020-02-06 13:46 ` [PATCH v2 4/4] drm/panfrost: Register to the Energy Model with devfreq device lukasz.luba
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20200207120430.GA242912@google.com \
--to=qperret@google.com \
--cc=Chris.Redpath@arm.com \
--cc=Dietmar.Eggemann@arm.com \
--cc=Morten.Rasmussen@arm.com \
--cc=agross@kernel.org \
--cc=airlied@linux.ie \
--cc=alyssa.rosenzweig@collabora.com \
--cc=amit.kucheria@verdurent.com \
--cc=b.zolnierkie@samsung.com \
--cc=bjorn.andersson@linaro.org \
--cc=bsegall@google.com \
--cc=cw00.choi@samsung.com \
--cc=daniel.lezcano@linaro.org \
--cc=daniel@ffwll.ch \
--cc=dri-devel@lists.freedesktop.org \
--cc=festevam@gmail.com \
--cc=ionela.voinescu@arm.com \
--cc=javi.merino@arm.com \
--cc=juri.lelli@redhat.com \
--cc=kernel@pengutronix.de \
--cc=khilman@kernel.org \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-arm-msm@vger.kernel.org \
--cc=linux-imx@nxp.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mediatek@lists.infradead.org \
--cc=linux-omap@vger.kernel.org \
--cc=linux-pm@vger.kernel.org \
--cc=liviu.dudau@arm.com \
--cc=lorenzo.pieralisi@arm.com \
--cc=lukasz.luba@arm.com \
--cc=matthias.bgg@gmail.com \
--cc=mgorman@suse.de \
--cc=mingo@redhat.com \
--cc=nm@ti.com \
--cc=patrick.bellasi@matbug.net \
--cc=peterz@infradead.org \
--cc=rjw@rjwysocki.net \
--cc=robh@kernel.org \
--cc=rostedt@goodmis.org \
--cc=rui.zhang@intel.com \
--cc=s.hauer@pengutronix.de \
--cc=sboyd@kernel.org \
--cc=shawnguo@kernel.org \
--cc=steven.price@arm.com \
--cc=sudeep.holla@arm.com \
--cc=tomeu.vizoso@collabora.com \
--cc=viresh.kumar@linaro.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).