From: Rahul Rameshbabu <rrameshbabu@nvidia.com>
To: "Arinzon, David" <darinzon@amazon.com>
Cc: Gal Pressman <gal@nvidia.com>, Jakub Kicinski <kuba@kernel.org>,
David Miller <davem@davemloft.net>,
"netdev@vger.kernel.org" <netdev@vger.kernel.org>,
Eric Dumazet <edumazet@google.com>,
Paolo Abeni <pabeni@redhat.com>,
Richard Cochran <richardcochran@gmail.com>,
"Woodhouse, David" <dwmw@amazon.co.uk>,
"Machulsky, Zorik" <zorik@amazon.com>,
"Matushevsky, Alexander" <matua@amazon.com>,
"Bshara, Saeed" <saeedb@amazon.com>,
"Wilson, Matt" <msw@amazon.com>,
"Liguori, Anthony" <aliguori@amazon.com>,
"Bshara, Nafea" <nafea@amazon.com>,
"Schmeilin, Evgeny" <evgenys@amazon.com>,
"Belgazal, Netanel" <netanel@amazon.com>,
"Saidi, Ali" <alisaidi@amazon.com>,
"Herrenschmidt, Benjamin" <benh@amazon.com>,
"Kiyanovski, Arthur" <akiyano@amazon.com>,
"Dagan, Noam" <ndagan@amazon.com>,
"Bernstein, Amit" <amitbern@amazon.com>,
"Agroskin, Shay" <shayagr@amazon.com>,
"Abboud, Osama" <osamaabb@amazon.com>,
"Ostrovsky, Evgeny" <evostrov@amazon.com>,
"Tabachnik, Ofir" <ofirt@amazon.com>,
"Machnikowski, Maciek" <maciek@machnikowski.net>
Subject: Re: [PATCH v3 net-next 3/3] net: ena: Add PHC documentation
Date: Tue, 05 Nov 2024 18:02:57 -0800 [thread overview]
Message-ID: <875xp1azla.fsf@nvidia.com> (raw)
In-Reply-To: <4ba02d13a8c14045b0df7deaee188f82@amazon.com> (David Arinzon's message of "Tue, 5 Nov 2024 16:52:11 +0000")
On Tue, 05 Nov, 2024 16:52:11 +0000 "Arinzon, David" <darinzon@amazon.com> wrote:
>> >>> +=================
>> >> ======================================================
>> >>> +**phc_cnt** Number of successful retrieved timestamps (below
>> >> expire timeout).
>> >>> +**phc_exp** Number of expired retrieved timestamps (above
>> >> expire timeout).
>> >>> +**phc_skp** Number of skipped get time attempts (during block
>> >> period).
>> >>> +**phc_err** Number of failed get time attempts (entering into
>> block
>> >> state).
>> >>> +=================
>> >> ======================================================
>> >>
>> >> I seem to recall we had an unpleasant conversation about using
>> >> standard stats recently. Please tell me where you looked to check if
>> >> Linux has standard stats for packet timestamping. We need to add the
>> right info there.
>> >> --
>> >> pw-bot: cr
>> >
>> > Hi Jakub,
>> >
>> > Just wanted to clarify that this feature and the associated
>> > documentation are specifically intended for reading a HW timestamp, not
>> for TX/RX packet timestamping.
>> > We reviewed similar drivers that support HW timestamping via
>> > `gettime64` and `gettimex64` APIs, and we couldn't identify any that
>> capture or report statistics related to reading a HW timestamp.
>> > Let us know if further details would be helpful.
>>
>> David, did you consider Rahul's recent timestamping stats API?
>> 0e9c127729be ("ethtool: add interface to read Tx hardware timestamping
>> statistics")
>
> Hi Gal,
>
> We've looked into the `get_ts_stats` ethtool hook, and it refers to TX HW packet timestamping
> and not HW timestamp which is retrieved through `gettime64` and `gettimex64`.
Hi folks,
I think everyone might be on the same page now, but I wanted to provide
some clarifications just in case.
The use case that the TX HW timestamping statistics covers
┌─────────────────────────────────────┐
│ │
│ │
│ ┌──────┤
│ NIC hw │ │ Packets out
│ │ PF ├────────────────────────────────────────────────►
│ │ │
│ └───┬──┤
│ │ │
│ │ │
│ │ │
│ ┌────┘ │
│ │ │
└─────────────────────────────┼───────┘
│Hw timestamp information per packet
┌─────────────────────────────┼───────┐
│ │ │
│ ┌─────▼────┐ │
│ │ cmsg │ │
│ │ │ │
│ │ queue │ │
│ │ │ │
│ └──────────┘ │
│ │
│ │
│ Linux Kernel Stack │
│ │
│ │
│ │
└─────────────────────────────────────┘
We are collecting statistics on every packet being sent out the wire.
The use case being described here
┌──────────────────────────────────┐
│ │
│ │
│ ┌───────┤
│ NIC hw │ │
│ │ PF │
│ ┌───────────────────┐ │ │
│ │ │ └───────┤
│ │ PHC │ │
│ │ │ │
│ │ │ │
│ │ (just a clock dev)│ │
│ └─────────┬─────────┘ │
│ │ │
└──────────────┼───────────────────┘
│ Query device's clock for current time
┌──────────────┼───────────────────┐ ┌─────────────────────────────────┐
│ │ │ │ │
│ │ │ │ │
│ ┌──────────┼─────────────┐ │ │ │
│ │ │ │ │ │ ┌───────────────────────┐ │
│ │ │ │ │ │ │ │ │
│ │ ▼ ───┼─────┼──────┼────┼──────────► │ │
│ │ │ │ │ │ │ │
│ │ .gettimex64 callback │ │ │ │ clock_gettime syscall│ │
│ └────────────────────────┘ │ │ └───────────────────────┘ │
│ │ │ │
│ Linux Kernel Stack │ │ Userspace │
│ │ │ │
└──────────────────────────────────┘ └─────────────────────────────────┘
The model above is about getting the time from the NIC hardware in the
userspace application (which has not involvement with TX/RX traffic).
I do think the phc-to-host related statistics are on the niche side of
things. The following drivers are error free in their gettimex64 paths.
* AMD pesando/ionic
* Broadcom Tigon3
* Intel ixgbe
* Intel igc
* NVIDIA mlxsw
* NVIDIA mlx5_core
The above drivers would definitely not benefit from having "phc
(nic)"-to-host related statistics being presented here. I am more in
favor of making these statistics specific to amazon's ENA driver since I
think most drivers do not have a complex . Also, what value is there in
the count of phc-to-host successful/failed operations versus just
keeping track of the errors in userspace for whoever is calling
clock_gettime. I am somewhat ok with these counters, but I honestly
cannot imagine any practical use to this especially since they are not
related to anything over-the-wire. So the errors in userspace would be
enough of an indicator of whether there is excessive utilization of the
requests and the counters seem redundant to that (at least to me). Feel
free to share how you feel these counters would be helpful beyond
handling the return codes through clock_gettime. I might just be missing
something.
Hope this helps.
--
Thanks,
Rahul Rameshbabu
next prev parent reply other threads:[~2024-11-06 2:03 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-11-03 11:31 [PATCH v3 net-next 0/3] PHC support in ENA driver David Arinzon
2024-11-03 11:31 ` [PATCH v3 net-next 1/3] net: ena: Add PHC support in the " David Arinzon
2024-11-03 11:31 ` [PATCH v3 net-next 2/3] net: ena: PHC silent reset David Arinzon
2024-11-03 11:31 ` [PATCH v3 net-next 3/3] net: ena: Add PHC documentation David Arinzon
2024-11-05 2:17 ` Jakub Kicinski
2024-11-05 10:52 ` Arinzon, David
2024-11-05 16:16 ` Gal Pressman
2024-11-05 16:52 ` Arinzon, David
2024-11-06 2:02 ` Rahul Rameshbabu [this message]
2024-11-06 2:15 ` Jakub Kicinski
2024-11-06 1:28 ` Jakub Kicinski
2024-11-12 17:53 ` Arinzon, David
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=875xp1azla.fsf@nvidia.com \
--to=rrameshbabu@nvidia.com \
--cc=akiyano@amazon.com \
--cc=aliguori@amazon.com \
--cc=alisaidi@amazon.com \
--cc=amitbern@amazon.com \
--cc=benh@amazon.com \
--cc=darinzon@amazon.com \
--cc=davem@davemloft.net \
--cc=dwmw@amazon.co.uk \
--cc=edumazet@google.com \
--cc=evgenys@amazon.com \
--cc=evostrov@amazon.com \
--cc=gal@nvidia.com \
--cc=kuba@kernel.org \
--cc=maciek@machnikowski.net \
--cc=matua@amazon.com \
--cc=msw@amazon.com \
--cc=nafea@amazon.com \
--cc=ndagan@amazon.com \
--cc=netanel@amazon.com \
--cc=netdev@vger.kernel.org \
--cc=ofirt@amazon.com \
--cc=osamaabb@amazon.com \
--cc=pabeni@redhat.com \
--cc=richardcochran@gmail.com \
--cc=saeedb@amazon.com \
--cc=shayagr@amazon.com \
--cc=zorik@amazon.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).