From: Jonathan Cameron <Jonathan.Cameron@Huawei.com>
To: Dave Jiang <dave.jiang@intel.com>
Cc: <linux-cxl@vger.kernel.org>, <dan.j.williams@intel.com>,
<ira.weiny@intel.com>, <vishal.l.verma@intel.com>,
<alison.schofield@intel.com>, <dave@stgolabs.net>
Subject: Re: [PATCH v3 3/3] cxl: Add checks to access_coordinate calculation to fail missing data
Date: Thu, 7 Mar 2024 12:59:00 +0000 [thread overview]
Message-ID: <20240307125900.00003469@Huawei.com> (raw)
In-Reply-To: <20240306175204.1906538-3-dave.jiang@intel.com>
On Wed, 6 Mar 2024 10:52:04 -0700
Dave Jiang <dave.jiang@intel.com> wrote:
> Jonathan noted that when the coordinates for host bridge and switches
> can be 0s if no actual data are retrieved and the calculation continues.
> The resulting number would be inaccurate. Add checks to ensure that the
> calculation would complete only if the numbers are valid.
>
> While not seen in the wild, issue may show up with a BIOS that reported
> CXL root ports via Generic Ports (via a PCI handle in the SRAT entry).
>
> Fixes: 14a6960b3e92 ("cxl: Add helper function that calculate performance data for downstream ports")
> Reported-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
> Signed-off-by: Dave Jiang <dave.jiang@intel.com>
Looks nice and clean now., So subject to resolution of the philosophy question
inline...
Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com>
> ---
> v3:
> - Simplify iteration loop in cxl_endpoint_get_perf_coordinates(). (Jonathan)
> ---
> drivers/cxl/core/port.c | 33 +++++++++++++++++++++++----------
> 1 file changed, 23 insertions(+), 10 deletions(-)
>
> diff --git a/drivers/cxl/core/port.c b/drivers/cxl/core/port.c
> index 6fa273677963..baaaeddae775 100644
> --- a/drivers/cxl/core/port.c
> +++ b/drivers/cxl/core/port.c
> @@ -2110,6 +2110,20 @@ static void combine_coordinates(struct access_coordinate *c1,
> c1->read_latency += c2->read_latency;
> }
>
> +static bool coordinates_invalid(struct access_coordinate *c)
> +{
> + if (!c->read_bandwidth && !c->write_bandwidth &&
> + !c->read_latency && !c->write_latency)
> + return true;
I'm not sure on logic here. I agree in theory it's possible to have
only one coord presented but do we actually want to support that?
One of those cases where perhaps Linux should insist on sanity even
if the specifications do not.
I'd demand them all and flip to coordinates_valid() perhaps?
return c->readbandwidth && c->write_bandwidth &&
c->read_latency && c->write_latency;
Maybe there is a dubious argument that a host might not provide bandwidth
to the HB if it knows it is way bigger than anything beyond that point...
Doubtful..
> +
> + return false;
> +}
> +
> +static bool parent_port_is_cxl_root(struct cxl_port *port)
> +{
> + return is_cxl_root(to_cxl_port(port->dev.parent));
> +}
> +
> /**
> * cxl_endpoint_get_perf_coordinates - Retrieve performance numbers stored in dports
> * of CXL path
> @@ -2133,23 +2147,22 @@ int cxl_endpoint_get_perf_coordinates(struct cxl_port *port,
> if (!is_cxl_endpoint(port))
> return -EINVAL;
>
> - dport = iter->parent_dport;
> -
> /*
> - * Exit the loop when the parent port of the current port is cxl root.
> - * The iterative loop starts at the endpoint and gathers the
> - * latency of the CXL link from the current iter to the next downstream
> - * port each iteration. If the parent is cxl root then there is
> - * nothing to gather.
> + * Exit the loop when the parent port of the current iter port is cxl
> + * root. The iterative loop starts at the endpoint and gathers the
> + * latency of the CXL link from the current device/port to the connected
> + * downstream port each iteration.
> */
> - while (!is_cxl_root(to_cxl_port(iter->dev.parent))) {
> + do {
> + dport = iter->parent_dport;
> + if (coordinates_invalid(&dport->coord))
> + return -EINVAL;
> combine_coordinates(&c, &dport->coord);
> c.write_latency += dport->link_latency;
> c.read_latency += dport->link_latency;
>
> iter = to_cxl_port(iter->dev.parent);
> - dport = iter->parent_dport;
> - }
> + } while (!parent_port_is_cxl_root(iter));
>
> /* Get the calculated PCI paths bandwidth */
> pdev = to_pci_dev(port->uport_dev->parent);
prev parent reply other threads:[~2024-03-07 12:59 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-03-06 17:52 [PATCH v3 1/3] cxl: Remove checking of iter in cxl_endpoint_get_perf_coordinates() Dave Jiang
2024-03-06 17:52 ` [PATCH v3 2/3] cxl: Consolidate dport access_coordinate ->hb_coord and ->sw_coord into ->coord Dave Jiang
2024-03-07 12:53 ` Jonathan Cameron
2024-03-06 17:52 ` [PATCH v3 3/3] cxl: Add checks to access_coordinate calculation to fail missing data Dave Jiang
2024-03-07 12:59 ` Jonathan Cameron [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20240307125900.00003469@Huawei.com \
--to=jonathan.cameron@huawei.com \
--cc=alison.schofield@intel.com \
--cc=dan.j.williams@intel.com \
--cc=dave.jiang@intel.com \
--cc=dave@stgolabs.net \
--cc=ira.weiny@intel.com \
--cc=linux-cxl@vger.kernel.org \
--cc=vishal.l.verma@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox