From: Jonathan Cameron <Jonathan.Cameron@Huawei.com>
To: Dave Jiang <dave.jiang@intel.com>
Cc: <linux-cxl@vger.kernel.org>, <dan.j.williams@intel.com>,
<ira.weiny@intel.com>, <vishal.l.verma@intel.com>,
<alison.schofield@intel.com>, <dave@stgolabs.net>,
<brice.goglin@gmail.com>, <nifan.cxl@gmail.com>
Subject: Re: [PATCH v2 1/3] cxl/region: Calculate performance data for a region
Date: Tue, 19 Dec 2023 14:51:35 +0000 [thread overview]
Message-ID: <20231219145135.000021f6@Huawei.com> (raw)
In-Reply-To: <170268215975.1381493.16321994239389305102.stgit@djiang5-mobl3>
On Fri, 15 Dec 2023 16:15:59 -0700
Dave Jiang <dave.jiang@intel.com> wrote:
> Calculate and store the performance data for a CXL region. Find the worst
> read and write latency for all the included ranges from each of the devices
> that attributes to the region and designate that as the latency data. Sum
> all the read and write bandwidth data for each of the device region and
> that is the total bandwidth for the region.
>
> The perf list is expected to be constructed before the endpoint decoders
> are registered and thus there should be no early reading of the entries
> from the region assemble action. The calling of the region qos calculate
> function is under the protection of cxl_dpa_rwsem and will ensure that
> all DPA associated work has completed.
>
> Signed-off-by: Dave Jiang <dave.jiang@intel.com>
Trivial comments inline. With the HMAT reference tweaked,
Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
> ---
> v2:
> - Move cxled declaration (Fan)
> - Move calculate function to core/cdat.c
> - Make cxlr->coord a struct instead of allocated (Dan)
> - Remove list_empty() check (Dan)
> - Move calculation to cxl_region_attach() under cxl_dpa_rwsem (Dan)
> - Normalize perf numbers to HMAT coords (Brice, Dan)
> ---
> drivers/cxl/core/cdat.c | 53 +++++++++++++++++++++++++++++++++++++++++++++
> drivers/cxl/core/region.c | 2 ++
> drivers/cxl/cxl.h | 5 ++++
> 3 files changed, 60 insertions(+)
>
> diff --git a/drivers/cxl/core/cdat.c b/drivers/cxl/core/cdat.c
> index 5fe57fe5e2ee..29bba04306e9 100644
> --- a/drivers/cxl/core/cdat.c
> +++ b/drivers/cxl/core/cdat.c
> @@ -547,3 +547,56 @@ void cxl_switch_parse_cdat(struct cxl_port *port)
> EXPORT_SYMBOL_NS_GPL(cxl_switch_parse_cdat, CXL);
>
> MODULE_IMPORT_NS(CXL);
> +
> +void cxl_region_perf_data_calculate(struct cxl_region *cxlr,
> + struct cxl_endpoint_decoder *cxled)
> +{
> + struct list_head *perf_list;
> + struct cxl_memdev *cxlmd = cxled_to_memdev(cxled);
> + struct cxl_dev_state *cxlds = cxlmd->cxlds;
> + struct cxl_memdev_state *mds = to_cxl_memdev_state(cxlds);
> + struct range dpa = {
> + .start = cxled->dpa_res->start,
> + .end = cxled->dpa_res->end,
> + };
> + struct cxl_dpa_perf *perf;
> + bool found = false;
> +
> + switch (cxlr->mode) {
> + case CXL_DECODER_RAM:
> + perf_list = &mds->ram_perf_list;
> + break;
> + case CXL_DECODER_PMEM:
> + perf_list = &mds->pmem_perf_list;
> + break;
> + default:
> + return;
> + }
> +
> + list_for_each_entry(perf, perf_list, list) {
> + if (range_contains(&perf->dpa_range, &dpa)) {
> + found = true;
> + break;
> + }
> + }
> +
> + if (!found)
> + return;
Could use
if (list_entry_is_head())
return;
and drop the found variable. Though that is a little bit specific to the
internals of the list infrastructure so maybe adding a variable is better..
There is precedence for both approaches in tree.
> +
> + /* Get total bandwidth and the worst latency for the cxl region */
> + cxlr->coord.read_latency = max_t(unsigned int,
> + cxlr->coord.read_latency,
> + perf->coord.read_latency);
> + cxlr->coord.write_latency = max_t(unsigned int,
> + cxlr->coord.write_latency,
> + perf->coord.write_latency);
> + cxlr->coord.read_bandwidth += perf->coord.read_bandwidth;
> + cxlr->coord.write_bandwidth += perf->coord.write_bandwidth;
> +
> + /*
> + * Convert latency to nanosec from picosec to be consistent with HMAT
HMAT version what? You may ask why is there a breaking change in the HMAT definition
between 6.2 and 6.3 but I'd rather you didn't :(
> + * attributes.
> + */
> + cxlr->coord.read_latency = DIV_ROUND_UP(cxlr->coord.read_latency, 1000);
> + cxlr->coord.write_latency = DIV_ROUND_UP(cxlr->coord.write_latency, 1000);
> +}
next prev parent reply other threads:[~2023-12-19 14:52 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-12-15 23:15 [PATCH v2 0/3] cxl: Add support to report region access coordinates to numa nodes Dave Jiang
2023-12-15 23:15 ` [PATCH v2 1/3] cxl/region: Calculate performance data for a region Dave Jiang
2023-12-19 14:51 ` Jonathan Cameron [this message]
2023-12-21 22:51 ` Dave Jiang
2024-01-08 13:58 ` Jonathan Cameron
2023-12-15 23:16 ` [PATCH v2 2/3] cxl/region: Add sysfs attribute for locality attributes of CXL regions Dave Jiang
2023-12-19 14:58 ` Jonathan Cameron
2023-12-15 23:16 ` [PATCH v2 3/3] cxl: Add memory hotplug notifier for cxl region Dave Jiang
2023-12-19 15:15 ` Jonathan Cameron
2023-12-22 18:17 ` Dave Jiang
2024-01-08 13:56 ` Jonathan Cameron
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20231219145135.000021f6@Huawei.com \
--to=jonathan.cameron@huawei.com \
--cc=alison.schofield@intel.com \
--cc=brice.goglin@gmail.com \
--cc=dan.j.williams@intel.com \
--cc=dave.jiang@intel.com \
--cc=dave@stgolabs.net \
--cc=ira.weiny@intel.com \
--cc=linux-cxl@vger.kernel.org \
--cc=nifan.cxl@gmail.com \
--cc=vishal.l.verma@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox