From: Jonathan Cameron <Jonathan.Cameron@Huawei.com>
To: Dave Jiang <dave.jiang@intel.com>
Cc: <linux-cxl@vger.kernel.org>, <dan.j.williams@intel.com>,
<ira.weiny@intel.com>, <vishal.l.verma@intel.com>,
<alison.schofield@intel.com>, <dave@stgolabs.net>,
<brice.goglin@gmail.com>, <nifan.cxl@gmail.com>
Subject: Re: [PATCH v2 1/3] cxl/region: Calculate performance data for a region
Date: Tue, 19 Dec 2023 14:51:35 +0000 [thread overview]
Message-ID: <20231219145135.000021f6@Huawei.com> (raw)
In-Reply-To: <170268215975.1381493.16321994239389305102.stgit@djiang5-mobl3>
On Fri, 15 Dec 2023 16:15:59 -0700
Dave Jiang <dave.jiang@intel.com> wrote:
> Calculate and store the performance data for a CXL region. Find the worst
> read and write latency for all the included ranges from each of the devices
> that attributes to the region and designate that as the latency data. Sum
> all the read and write bandwidth data for each of the device region and
> that is the total bandwidth for the region.
>
> The perf list is expected to be constructed before the endpoint decoders
> are registered and thus there should be no early reading of the entries
> from the region assemble action. The calling of the region qos calculate
> function is under the protection of cxl_dpa_rwsem and will ensure that
> all DPA associated work has completed.
>
> Signed-off-by: Dave Jiang <dave.jiang@intel.com>
Trivial comments inline. With the HMAT reference tweaked,
Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
> ---
> v2:
> - Move cxled declaration (Fan)
> - Move calculate function to core/cdat.c
> - Make cxlr->coord a struct instead of allocated (Dan)
> - Remove list_empty() check (Dan)
> - Move calculation to cxl_region_attach() under cxl_dpa_rwsem (Dan)
> - Normalize perf numbers to HMAT coords (Brice, Dan)
> ---
> drivers/cxl/core/cdat.c | 53 +++++++++++++++++++++++++++++++++++++++++++++
> drivers/cxl/core/region.c | 2 ++
> drivers/cxl/cxl.h | 5 ++++
> 3 files changed, 60 insertions(+)
>
> diff --git a/drivers/cxl/core/cdat.c b/drivers/cxl/core/cdat.c
> index 5fe57fe5e2ee..29bba04306e9 100644
> --- a/drivers/cxl/core/cdat.c
> +++ b/drivers/cxl/core/cdat.c
> @@ -547,3 +547,56 @@ void cxl_switch_parse_cdat(struct cxl_port *port)
> EXPORT_SYMBOL_NS_GPL(cxl_switch_parse_cdat, CXL);
>
> MODULE_IMPORT_NS(CXL);
> +
> +void cxl_region_perf_data_calculate(struct cxl_region *cxlr,
> + struct cxl_endpoint_decoder *cxled)
> +{
> + struct list_head *perf_list;
> + struct cxl_memdev *cxlmd = cxled_to_memdev(cxled);
> + struct cxl_dev_state *cxlds = cxlmd->cxlds;
> + struct cxl_memdev_state *mds = to_cxl_memdev_state(cxlds);
> + struct range dpa = {
> + .start = cxled->dpa_res->start,
> + .end = cxled->dpa_res->end,
> + };
> + struct cxl_dpa_perf *perf;
> + bool found = false;
> +
> + switch (cxlr->mode) {
> + case CXL_DECODER_RAM:
> + perf_list = &mds->ram_perf_list;
> + break;
> + case CXL_DECODER_PMEM:
> + perf_list = &mds->pmem_perf_list;
> + break;
> + default:
> + return;
> + }
> +
> + list_for_each_entry(perf, perf_list, list) {
> + if (range_contains(&perf->dpa_range, &dpa)) {
> + found = true;
> + break;
> + }
> + }
> +
> + if (!found)
> + return;
Could use
if (list_entry_is_head())
return;
and drop the found variable. Though that is a little bit specific to the
internals of the list infrastructure so maybe adding a variable is better..
There is precedence for both approaches in tree.
> +
> + /* Get total bandwidth and the worst latency for the cxl region */
> + cxlr->coord.read_latency = max_t(unsigned int,
> + cxlr->coord.read_latency,
> + perf->coord.read_latency);
> + cxlr->coord.write_latency = max_t(unsigned int,
> + cxlr->coord.write_latency,
> + perf->coord.write_latency);
> + cxlr->coord.read_bandwidth += perf->coord.read_bandwidth;
> + cxlr->coord.write_bandwidth += perf->coord.write_bandwidth;
> +
> + /*
> + * Convert latency to nanosec from picosec to be consistent with HMAT
HMAT version what? You may ask why is there a breaking change in the HMAT definition
between 6.2 and 6.3 but I'd rather you didn't :(
> + * attributes.
> + */
> + cxlr->coord.read_latency = DIV_ROUND_UP(cxlr->coord.read_latency, 1000);
> + cxlr->coord.write_latency = DIV_ROUND_UP(cxlr->coord.write_latency, 1000);
> +}
next prev parent reply other threads:[~2023-12-19 14:52 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-12-15 23:15 [PATCH v2 0/3] cxl: Add support to report region access coordinates to numa nodes Dave Jiang
2023-12-15 23:15 ` [PATCH v2 1/3] cxl/region: Calculate performance data for a region Dave Jiang
2023-12-19 14:51 ` Jonathan Cameron [this message]
2023-12-21 22:51 ` Dave Jiang
2024-01-08 13:58 ` Jonathan Cameron
2023-12-15 23:16 ` [PATCH v2 2/3] cxl/region: Add sysfs attribute for locality attributes of CXL regions Dave Jiang
2023-12-19 14:58 ` Jonathan Cameron
2023-12-15 23:16 ` [PATCH v2 3/3] cxl: Add memory hotplug notifier for cxl region Dave Jiang
2023-12-19 15:15 ` Jonathan Cameron
2023-12-22 18:17 ` Dave Jiang
2024-01-08 13:56 ` Jonathan Cameron
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20231219145135.000021f6@Huawei.com \
--to=jonathan.cameron@huawei.com \
--cc=alison.schofield@intel.com \
--cc=brice.goglin@gmail.com \
--cc=dan.j.williams@intel.com \
--cc=dave.jiang@intel.com \
--cc=dave@stgolabs.net \
--cc=ira.weiny@intel.com \
--cc=linux-cxl@vger.kernel.org \
--cc=nifan.cxl@gmail.com \
--cc=vishal.l.verma@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.