From: Cristian Marussi <cristian.marussi@arm.com>
To: Luke Parkin <luke.parkin@arm.com>
Cc: linux-kernel@vger.kernel.org,
linux-arm-kernel@lists.infradead.org, arm-scmi@vger.kernel.org,
sudeep.holla@arm.com, cristian.marussi@arm.com
Subject: Re: [PATCH v2 3/4] firmware: arm_scmi: Track basic SCMI statistics
Date: Mon, 8 Jul 2024 15:00:31 +0100 [thread overview]
Message-ID: <Zovw_4MgumQo8npi@pluto> (raw)
In-Reply-To: <20240703143738.2007457-4-luke.parkin@arm.com>
On Wed, Jul 03, 2024 at 03:37:37PM +0100, Luke Parkin wrote:
> Add tracking of 5 initial statistics
>
> Signed-off-by: Luke Parkin <luke.parkin@arm.com>
Missing
---
> V1->V2
> Drop unneccesary atomic_set's
> Use new 'scmi_log_stats' to simplify incrementing of atomics
> Move scmi_log_stats to locations which mean no extra conditionals
> are needed
> ---
> drivers/firmware/arm_scmi/driver.c | 20 +++++++++++++++-----
> 1 file changed, 15 insertions(+), 5 deletions(-)
>
> diff --git a/drivers/firmware/arm_scmi/driver.c b/drivers/firmware/arm_scmi/driver.c
> index df3eb17cf439..937546397cf2 100644
> --- a/drivers/firmware/arm_scmi/driver.c
> +++ b/drivers/firmware/arm_scmi/driver.c
> @@ -1146,8 +1146,10 @@ static void scmi_handle_response(struct scmi_chan_info *cinfo,
> if (xfer->hdr.type == MSG_TYPE_DELAYED_RESP) {
> scmi_clear_channel(info, cinfo);
> complete(xfer->async_done);
> + scmi_log_stats(&info->stats.dlyd_response_ok);
> } else {
> complete(&xfer->done);
> + scmi_log_stats(&info->stats.response_ok);
> }
>
> if (IS_ENABLED(CONFIG_ARM_SCMI_RAW_MODE_SUPPORT)) {
Up above here in scmi_handle_respnonse() I would take care to track a
few interesting stats about when reponses that went bad...if you look at
scmi_xfer_command_acquire() you will notice that a whole class of errors
there, and spread all around the core, are handled by scmi_bad_message_trace():
please handle all of these errors based on the type of the enum scmi_bad_msg...
....which means, IOW, add a counter for each the enum scmi_bad_msg and handle
them in scmi_bad_message_trace(), so that they will get counted everywhere....
and... :P ... while doing this please consider as usual the fact that we will
like the log_stats call to vanish completely at compile time if STATS=n
...then please look also into handle_notification() and try to add a couple
of counters there too...
... we can discuss all of these new counters in the next review...
> @@ -1231,6 +1233,7 @@ static int scmi_wait_for_reply(struct device *dev, const struct scmi_desc *desc,
> struct scmi_xfer *xfer, unsigned int timeout_ms)
> {
> int ret = 0;
> + struct scmi_info *info = handle_to_scmi_info(cinfo->handle);
>
> if (xfer->hdr.poll_completion) {
> /*
> @@ -1251,13 +1254,12 @@ static int scmi_wait_for_reply(struct device *dev, const struct scmi_desc *desc,
> "timed out in resp(caller: %pS) - polling\n",
> (void *)_RET_IP_);
> ret = -ETIMEDOUT;
> + scmi_log_stats(&info->stats.xfers_response_timeout);
I would rename this as 'xfer_response_polled_timeout' (so adding one more
stats to the structs and debugfs) in order to discern this timeout case
for the one below.
> }
> }
>
> if (!ret) {
> unsigned long flags;
> - struct scmi_info *info =
> - handle_to_scmi_info(cinfo->handle);
>
..in this same code-block, down below right before the trace_scmi_msg, I
would add a:
scmi_log_stats(&info->stats.response_polled_ok);
...if not, you will end-up counting only the successfull reply on the IRQ
path, i.e. only the one that are served by handle_response.
> /*
> * Do not fetch_response if an out-of-order delayed
> @@ -1291,6 +1293,7 @@ static int scmi_wait_for_reply(struct device *dev, const struct scmi_desc *desc,
> dev_err(dev, "timed out in resp(caller: %pS)\n",
> (void *)_RET_IP_);
> ret = -ETIMEDOUT;
> + scmi_log_stats(&info->stats.xfers_response_timeout);
This is a different kind of timeout...on non-polled request, lets keep
it named as it is now...
> }
> }
>
> @@ -1374,13 +1377,15 @@ static int do_xfer(const struct scmi_protocol_handle *ph,
> !is_transport_polling_capable(info->desc)) {
> dev_warn_once(dev,
> "Polling mode is not supported by transport.\n");
> + scmi_log_stats(&info->stats.sent_fail);
Same here ... you have different reason for failures in the following,
so you could try to count them separately...
sent_fail_polling_unsupported
> return -EINVAL;
> }
>
> cinfo = idr_find(&info->tx_idr, pi->proto->id);
> - if (unlikely(!cinfo))
> + if (unlikely(!cinfo)) {
> + scmi_log_stats(&info->stats.sent_fail);
sent_fail_channel_not_found
> return -EINVAL;
> -
> + }
> /* True ONLY if also supported by transport. */
> if (is_polling_enabled(cinfo, info->desc))
> xfer->hdr.poll_completion = true;
> @@ -1412,6 +1417,7 @@ static int do_xfer(const struct scmi_protocol_handle *ph,
> ret = info->desc->ops->send_message(cinfo, xfer);
> if (ret < 0) {
> dev_dbg(dev, "Failed to send message %d\n", ret);
> + scmi_log_stats(&info->stats.sent_fail);
this is the basic, original send failure...it is a generic failure at the transport layer,
so lets keep it named like it is now...
> return ret;
> }
>
...here you can just place your (instead of the one down below)
scmi_log_stats(&info->stats.sent_ok);
...becasue now, right here, you are already sure that the message has been sent ...
> @@ -1420,8 +1426,12 @@ static int do_xfer(const struct scmi_protocol_handle *ph,
> xfer->hdr.status, xfer->tx.buf, xfer->tx.len);
>
> ret = scmi_wait_for_message_response(cinfo, xfer);
> - if (!ret && xfer->hdr.status)
> + if (!ret && xfer->hdr.status) {
> ret = scmi_to_linux_errno(xfer->hdr.status);
> + scmi_log_stats(&info->stats.sent_fail);
...this is NOT a sent_fail...you DID send successfully the message AND received
successfully a response (ret==0) BUT there was an error of some kind at the protocol
layer...i.e. the server sent you an error code in the reply...so I would count here
instead something named like
stats.protocol_errs
> + } else {
> + scmi_log_stats(&info->stats.sent_ok);
and this is really late to be counted as sent, so it would be clearer to
count it above (where I mentiones), so please drop the else.
Thanks,
Cristian
next prev parent reply other threads:[~2024-07-08 14:01 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-07-03 14:37 [PATCH v2 0/4] Add Per-transport SCMI debug statistics Luke Parkin
2024-07-03 14:37 ` [PATCH v2 1/4] firmware: arm_scmi: Remove superfluous handle_to_scmi_info Luke Parkin
2024-07-08 11:08 ` Cristian Marussi
2024-07-03 14:37 ` [PATCH v2 2/4] firmware: arm_scmi: Add support for tracking statistics Luke Parkin
2024-07-08 13:01 ` Cristian Marussi
2024-07-03 14:37 ` [PATCH v2 3/4] firmware: arm_scmi: Track basic SCMI statistics Luke Parkin
2024-07-08 14:00 ` Cristian Marussi [this message]
2024-07-03 14:37 ` [PATCH v2 4/4] firmware: arm_scmi: Create debugfs files for statistics Luke Parkin
2024-07-08 14:07 ` Cristian Marussi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Zovw_4MgumQo8npi@pluto \
--to=cristian.marussi@arm.com \
--cc=arm-scmi@vger.kernel.org \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=luke.parkin@arm.com \
--cc=sudeep.holla@arm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).