From: Zhu Yanjun <yanjun.zhu@linux.dev>
To: Mark Bloch <mbloch@nvidia.com>,
"David S. Miller" <davem@davemloft.net>,
Jakub Kicinski <kuba@kernel.org>, Paolo Abeni <pabeni@redhat.com>,
Eric Dumazet <edumazet@google.com>,
Andrew Lunn <andrew+netdev@lunn.ch>,
Simon Horman <horms@kernel.org>
Cc: saeedm@nvidia.com, gal@nvidia.com, leonro@nvidia.com,
tariqt@nvidia.com, Leon Romanovsky <leon@kernel.org>,
Jesper Dangaard Brouer <hawk@kernel.org>,
Ilias Apalodimas <ilias.apalodimas@linaro.org>,
Richard Cochran <richardcochran@gmail.com>,
Alexei Starovoitov <ast@kernel.org>,
Daniel Borkmann <daniel@iogearbox.net>,
John Fastabend <john.fastabend@gmail.com>,
Stanislav Fomichev <sdf@fomichev.me>,
netdev@vger.kernel.org, linux-rdma@vger.kernel.org,
linux-kernel@vger.kernel.org, bpf@vger.kernel.org,
Dragos Tatulea <dtatulea@nvidia.com>
Subject: Re: [PATCH net-next v6 10/12] net/mlx5e: Implement queue mgmt ops and single channel swap
Date: Tue, 17 Jun 2025 23:06:52 -0700 [thread overview]
Message-ID: <325ab9a0-44d1-44a2-aefe-9cd49dcd12f5@linux.dev> (raw)
In-Reply-To: <20250616141441.1243044-11-mbloch@nvidia.com>
在 2025/6/16 7:14, Mark Bloch 写道:
> From: Saeed Mahameed <saeedm@nvidia.com>
>
> The bulk of the work is done in mlx5e_queue_mem_alloc, where we allocate
> and create the new channel resources, similar to
> mlx5e_safe_switch_params, but here we do it for a single channel using
> existing params, sort of a clone channel.
> To swap the old channel with the new one, we deactivate and close the
> old channel then replace it with the new one, since the swap procedure
> doesn't fail in mlx5, we do it all in one place (mlx5e_queue_start).
>
> Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
> Reviewed-by: Dragos Tatulea <dtatulea@nvidia.com>
> Reviewed-by: Tariq Toukan <tariqt@nvidia.com>
> Signed-off-by: Mark Bloch <mbloch@nvidia.com>
> ---
> .../net/ethernet/mellanox/mlx5/core/en_main.c | 98 +++++++++++++++++++
> 1 file changed, 98 insertions(+)
>
> diff --git a/drivers/net/ethernet/mellanox/mlx5/core/en_main.c b/drivers/net/ethernet/mellanox/mlx5/core/en_main.c
> index a51e204bd364..873a42b4a82d 100644
> --- a/drivers/net/ethernet/mellanox/mlx5/core/en_main.c
> +++ b/drivers/net/ethernet/mellanox/mlx5/core/en_main.c
> @@ -5494,6 +5494,103 @@ static const struct netdev_stat_ops mlx5e_stat_ops = {
> .get_base_stats = mlx5e_get_base_stats,
> };
>
> +struct mlx5_qmgmt_data {
> + struct mlx5e_channel *c;
> + struct mlx5e_channel_param cparam;
> +};
> +
> +static int mlx5e_queue_mem_alloc(struct net_device *dev, void *newq,
> + int queue_index)
> +{
> + struct mlx5_qmgmt_data *new = (struct mlx5_qmgmt_data *)newq;
> + struct mlx5e_priv *priv = netdev_priv(dev);
> + struct mlx5e_channels *chs = &priv->channels;
> + struct mlx5e_params params = chs->params;
RCT (Reverse Christmas Tree) ?
Yanjun.Zhu
> + struct mlx5_core_dev *mdev;
> + int err;
> +
> + mutex_lock(&priv->state_lock);
> + if (!test_bit(MLX5E_STATE_OPENED, &priv->state)) {
> + err = -ENODEV;
> + goto unlock;
> + }
> +
> + if (queue_index >= chs->num) {
> + err = -ERANGE;
> + goto unlock;
> + }
> +
> + if (MLX5E_GET_PFLAG(&chs->params, MLX5E_PFLAG_TX_PORT_TS) ||
> + chs->params.ptp_rx ||
> + chs->params.xdp_prog ||
> + priv->htb) {
> + netdev_err(priv->netdev,
> + "Cloning channels with Port/rx PTP, XDP or HTB is not supported\n");
> + err = -EOPNOTSUPP;
> + goto unlock;
> + }
> +
> + mdev = mlx5_sd_ch_ix_get_dev(priv->mdev, queue_index);
> + err = mlx5e_build_channel_param(mdev, ¶ms, &new->cparam);
> + if (err)
> + goto unlock;
> +
> + err = mlx5e_open_channel(priv, queue_index, ¶ms, NULL, &new->c);
> +unlock:
> + mutex_unlock(&priv->state_lock);
> + return err;
> +}
> +
> +static void mlx5e_queue_mem_free(struct net_device *dev, void *mem)
> +{
> + struct mlx5_qmgmt_data *data = (struct mlx5_qmgmt_data *)mem;
> +
> + /* not supposed to happen since mlx5e_queue_start never fails
> + * but this is how this should be implemented just in case
> + */
> + if (data->c)
> + mlx5e_close_channel(data->c);
> +}
> +
> +static int mlx5e_queue_stop(struct net_device *dev, void *oldq, int queue_index)
> +{
> + /* In mlx5 a txq cannot be simply stopped in isolation, only restarted.
> + * mlx5e_queue_start does not fail, we stop the old queue there.
> + * TODO: Improve this.
> + */
> + return 0;
> +}
> +
> +static int mlx5e_queue_start(struct net_device *dev, void *newq,
> + int queue_index)
> +{
> + struct mlx5_qmgmt_data *new = (struct mlx5_qmgmt_data *)newq;
> + struct mlx5e_priv *priv = netdev_priv(dev);
> + struct mlx5e_channel *old;
> +
> + mutex_lock(&priv->state_lock);
> +
> + /* stop and close the old */
> + old = priv->channels.c[queue_index];
> + mlx5e_deactivate_priv_channels(priv);
> + /* close old before activating new, to avoid napi conflict */
> + mlx5e_close_channel(old);
> +
> + /* start the new */
> + priv->channels.c[queue_index] = new->c;
> + mlx5e_activate_priv_channels(priv);
> + mutex_unlock(&priv->state_lock);
> + return 0;
> +}
> +
> +static const struct netdev_queue_mgmt_ops mlx5e_queue_mgmt_ops = {
> + .ndo_queue_mem_size = sizeof(struct mlx5_qmgmt_data),
> + .ndo_queue_mem_alloc = mlx5e_queue_mem_alloc,
> + .ndo_queue_mem_free = mlx5e_queue_mem_free,
> + .ndo_queue_start = mlx5e_queue_start,
> + .ndo_queue_stop = mlx5e_queue_stop,
> +};
> +
> static void mlx5e_build_nic_netdev(struct net_device *netdev)
> {
> struct mlx5e_priv *priv = netdev_priv(netdev);
> @@ -5504,6 +5601,7 @@ static void mlx5e_build_nic_netdev(struct net_device *netdev)
> SET_NETDEV_DEV(netdev, mdev->device);
>
> netdev->netdev_ops = &mlx5e_netdev_ops;
> + netdev->queue_mgmt_ops = &mlx5e_queue_mgmt_ops;
> netdev->xdp_metadata_ops = &mlx5e_xdp_metadata_ops;
> netdev->xsk_tx_metadata_ops = &mlx5e_xsk_tx_metadata_ops;
> netdev->request_ops_lock = true;
next prev parent reply other threads:[~2025-06-18 6:07 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-06-16 14:14 [PATCH net-next v6 00/12] net/mlx5e: Add support for devmem and io_uring TCP zero-copy Mark Bloch
2025-06-16 14:14 ` [PATCH net-next v6 01/12] net: Allow const args for of page_to_netmem() Mark Bloch
2025-06-16 14:14 ` [PATCH net-next v6 02/12] net: Add skb_can_coalesce for netmem Mark Bloch
2025-06-16 14:14 ` [PATCH net-next v6 03/12] page_pool: Add page_pool_dev_alloc_netmems helper Mark Bloch
2025-06-16 14:14 ` [PATCH net-next v6 04/12] net/mlx5e: SHAMPO: Reorganize mlx5_rq_shampo_alloc Mark Bloch
2025-06-16 14:14 ` [PATCH net-next v6 05/12] net/mlx5e: SHAMPO: Remove redundant params Mark Bloch
2025-06-16 14:14 ` [PATCH net-next v6 06/12] net/mlx5e: SHAMPO: Improve hw gro capability checking Mark Bloch
2025-06-16 14:14 ` [PATCH net-next v6 07/12] net/mlx5e: SHAMPO: Separate pool for headers Mark Bloch
2025-06-16 14:14 ` [PATCH net-next v6 08/12] net/mlx5e: Convert over to netmem Mark Bloch
2025-06-16 14:14 ` [PATCH net-next v6 09/12] net/mlx5e: Add support for UNREADABLE netmem page pools Mark Bloch
2025-06-16 14:14 ` [PATCH net-next v6 10/12] net/mlx5e: Implement queue mgmt ops and single channel swap Mark Bloch
2025-06-16 23:20 ` Mina Almasry
2025-06-18 6:06 ` Zhu Yanjun [this message]
2025-06-16 14:14 ` [PATCH net-next v6 11/12] net/mlx5e: Support ethtool tcp-data-split settings Mark Bloch
2025-06-16 14:14 ` [PATCH net-next v6 12/12] net/mlx5e: Add TX support for netmems Mark Bloch
2025-06-18 22:16 ` Stanislav Fomichev
2025-06-19 7:19 ` Dragos Tatulea
2025-06-19 15:32 ` Mina Almasry
2025-06-19 16:07 ` Dragos Tatulea
2025-06-19 22:19 ` Mina Almasry
2025-06-25 10:57 ` Dragos Tatulea
2025-06-18 2:01 ` [PATCH net-next v6 00/12] net/mlx5e: Add support for devmem and io_uring TCP zero-copy patchwork-bot+netdevbpf
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=325ab9a0-44d1-44a2-aefe-9cd49dcd12f5@linux.dev \
--to=yanjun.zhu@linux.dev \
--cc=andrew+netdev@lunn.ch \
--cc=ast@kernel.org \
--cc=bpf@vger.kernel.org \
--cc=daniel@iogearbox.net \
--cc=davem@davemloft.net \
--cc=dtatulea@nvidia.com \
--cc=edumazet@google.com \
--cc=gal@nvidia.com \
--cc=hawk@kernel.org \
--cc=horms@kernel.org \
--cc=ilias.apalodimas@linaro.org \
--cc=john.fastabend@gmail.com \
--cc=kuba@kernel.org \
--cc=leon@kernel.org \
--cc=leonro@nvidia.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-rdma@vger.kernel.org \
--cc=mbloch@nvidia.com \
--cc=netdev@vger.kernel.org \
--cc=pabeni@redhat.com \
--cc=richardcochran@gmail.com \
--cc=saeedm@nvidia.com \
--cc=sdf@fomichev.me \
--cc=tariqt@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.