public inbox for netdev@vger.kernel.org
 help / color / mirror / Atom feed
From: Paolo Abeni <pabeni@redhat.com>
To: Ratheesh Kannoth <rkannoth@marvell.com>,
	netdev@vger.kernel.org, linux-kernel@vger.kernel.org,
	linux-rdma@vger.kernel.org
Cc: sgoutham@marvell.com, andrew+netdev@lunn.ch, davem@davemloft.net,
	edumazet@google.com, kuba@kernel.org, donald.hunter@gmail.com,
	horms@kernel.org, jiri@resnulli.us, chuck.lever@oracle.com,
	matttbe@kernel.org, cjubran@nvidia.com, saeedm@nvidia.com,
	leon@kernel.org, tariqt@nvidia.com, mbloch@nvidia.com,
	dtatulea@nvidia.com
Subject: Re: [PATCH v10 net-next 3/6] devlink: Implement devlink param multi attribute nested data values
Date: Tue, 7 Apr 2026 11:58:09 +0200	[thread overview]
Message-ID: <c14a0783-a69f-448d-a464-2d802e6d0ec7@redhat.com> (raw)
In-Reply-To: <20260403025533.6250-4-rkannoth@marvell.com>

On 4/3/26 4:55 AM, Ratheesh Kannoth wrote:
> From: Saeed Mahameed <saeedm@nvidia.com>
> 
> Devlink param value attribute is not defined since devlink is handling
> the value validating and parsing internally, this allows us to implement
> multi attribute values without breaking any policies.
> 
> Devlink param multi-attribute values are considered to be dynamically
> sized arrays of u64 values, by introducing a new devlink param type
> DEVLINK_PARAM_TYPE_U64_ARRAY, driver and user space can set a variable
> count of u32 values into the DEVLINK_ATTR_PARAM_VALUE_DATA attribute.
> 
> Implement get/set parsing and add to the internal value structure passed
> to drivers.
> 
> This is useful for devices that need to configure a list of values for
> a specific configuration.
> 
> example:
> $ devlink dev param show pci/... name multi-value-param
> name multi-value-param type driver-specific
> values:
> cmode permanent value: 0,1,2,3,4,5,6,7
> 
> $ devlink dev param set pci/... name multi-value-param \
> 		value 4,5,6,7,0,1,2,3 cmode permanent
> 
> Signed-off-by: Saeed Mahameed <saeedm@nvidia.com>
> Signed-off-by: Ratheesh Kannoth <rkannoth@marvell.com>
> ---
>  Documentation/netlink/specs/devlink.yaml |  4 ++
>  include/net/devlink.h                    |  8 +++
>  include/uapi/linux/devlink.h             |  1 +
>  net/devlink/netlink_gen.c                |  2 +
>  net/devlink/param.c                      | 91 +++++++++++++++++++-----
>  5 files changed, 89 insertions(+), 17 deletions(-)
> 
> diff --git a/Documentation/netlink/specs/devlink.yaml b/Documentation/netlink/specs/devlink.yaml
> index b495d56b9137..b619de4fe08a 100644
> --- a/Documentation/netlink/specs/devlink.yaml
> +++ b/Documentation/netlink/specs/devlink.yaml
> @@ -226,6 +226,10 @@ definitions:
>          value: 10
>        -
>          name: binary
> +      -
> +        name: u64-array
> +        value: 129
> +
>    -
>      name: rate-tc-index-max
>      type: const
> diff --git a/include/net/devlink.h b/include/net/devlink.h
> index 3038af6ec017..3a355fea8189 100644
> --- a/include/net/devlink.h
> +++ b/include/net/devlink.h
> @@ -432,6 +432,13 @@ enum devlink_param_type {
>  	DEVLINK_PARAM_TYPE_U64 = DEVLINK_VAR_ATTR_TYPE_U64,
>  	DEVLINK_PARAM_TYPE_STRING = DEVLINK_VAR_ATTR_TYPE_STRING,
>  	DEVLINK_PARAM_TYPE_BOOL = DEVLINK_VAR_ATTR_TYPE_FLAG,
> +	DEVLINK_PARAM_TYPE_U64_ARRAY = DEVLINK_VAR_ATTR_TYPE_U64_ARRAY,
> +};
> +
> +#define __DEVLINK_PARAM_MAX_ARRAY_SIZE 32
> +struct devlink_param_u64_array {
> +	u64 size;
> +	u64 val[__DEVLINK_PARAM_MAX_ARRAY_SIZE];
>  };
>  
>  union devlink_param_value {
> @@ -441,6 +448,7 @@ union devlink_param_value {
>  	u64 vu64;
>  	char vstr[__DEVLINK_PARAM_MAX_STRING_VALUE];
>  	bool vbool;
> +	struct devlink_param_u64_array u64arr;

Sashiko as a couple of relevant remarks here, specifically:

---
Does this increase the size of union devlink_param_value from 32 bytes
to over 264 bytes?
Looking at existing functions like devlink_nl_param_value_fill_one() and
devlink_nl_param_value_put(), they take multiple copies of this union by
value. Passing two of these unions by value consumes over 528 bytes of
stack space, and combined in a call chain this pushes nearly 800 bytes
of arguments onto the stack.
Could this create a risk of hitting CONFIG_FRAME_WARN limits deep in
driver notification contexts? Should the signatures of the internal
functions and exported APIs be updated to pass the unions by pointer
instead?
---

>  };
>  
>  struct devlink_param_gset_ctx {
> diff --git a/include/uapi/linux/devlink.h b/include/uapi/linux/devlink.h
> index 7de2d8cc862f..5332223dd6d0 100644
> --- a/include/uapi/linux/devlink.h
> +++ b/include/uapi/linux/devlink.h
> @@ -406,6 +406,7 @@ enum devlink_var_attr_type {
>  	DEVLINK_VAR_ATTR_TYPE_BINARY,
>  	__DEVLINK_VAR_ATTR_TYPE_CUSTOM_BASE = 0x80,
>  	/* Any possible custom types, unrelated to NLA_* values go below */
> +	DEVLINK_VAR_ATTR_TYPE_U64_ARRAY,
>  };
>  
>  enum devlink_attr {
> diff --git a/net/devlink/netlink_gen.c b/net/devlink/netlink_gen.c
> index eb35e80e01d1..7aaf462f27ee 100644
> --- a/net/devlink/netlink_gen.c
> +++ b/net/devlink/netlink_gen.c
> @@ -37,6 +37,8 @@ devlink_attr_param_type_validate(const struct nlattr *attr,
>  	case DEVLINK_VAR_ATTR_TYPE_NUL_STRING:
>  		fallthrough;
>  	case DEVLINK_VAR_ATTR_TYPE_BINARY:
> +		fallthrough;
> +	case DEVLINK_VAR_ATTR_TYPE_U64_ARRAY:
>  		return 0;
>  	}
>  	NL_SET_ERR_MSG_ATTR(extack, attr, "invalid enum value");
> diff --git a/net/devlink/param.c b/net/devlink/param.c
> index cf95268da5b0..2ec85dffd8ac 100644
> --- a/net/devlink/param.c
> +++ b/net/devlink/param.c
> @@ -252,6 +252,14 @@ devlink_nl_param_value_put(struct sk_buff *msg, enum devlink_param_type type,
>  				return -EMSGSIZE;
>  		}
>  		break;
> +	case DEVLINK_PARAM_TYPE_U64_ARRAY:
> +		if (val.u64arr.size > __DEVLINK_PARAM_MAX_ARRAY_SIZE)
> +			return -EMSGSIZE;
> +
> +		for (int i = 0; i < val.u64arr.size; i++)
> +			if (nla_put_uint(msg, nla_type, val.u64arr.val[i]))
> +				return -EMSGSIZE;
> +		break;
>  	}
>  	return 0;
>  }
> @@ -304,56 +312,78 @@ static int devlink_nl_param_fill(struct sk_buff *msg, struct devlink *devlink,
>  				 u32 portid, u32 seq, int flags,
>  				 struct netlink_ext_ack *extack)
>  {
> -	union devlink_param_value default_value[DEVLINK_PARAM_CMODE_MAX + 1];
> -	union devlink_param_value param_value[DEVLINK_PARAM_CMODE_MAX + 1];
>  	bool default_value_set[DEVLINK_PARAM_CMODE_MAX + 1] = {};
>  	bool param_value_set[DEVLINK_PARAM_CMODE_MAX + 1] = {};
>  	const struct devlink_param *param = param_item->param;
> -	struct devlink_param_gset_ctx ctx;
> +	union devlink_param_value *default_value;
> +	union devlink_param_value *param_value;
> +	struct devlink_param_gset_ctx *ctx;
>  	struct nlattr *param_values_list;
>  	struct nlattr *param_attr;
>  	void *hdr;
>  	int err;
>  	int i;
>  
> +	default_value = kcalloc(DEVLINK_PARAM_CMODE_MAX + 1,
> +				sizeof(*default_value), GFP_KERNEL);
> +	if (!default_value)
> +		return -ENOMEM;
> +
> +	param_value = kcalloc(DEVLINK_PARAM_CMODE_MAX + 1,
> +			      sizeof(*param_value), GFP_KERNEL);
> +	if (!param_value) {
> +		kfree(default_value);
> +		return -ENOMEM;
> +	}
> +
> +	ctx = kmalloc_obj(*ctx);
> +	if (!ctx) {
> +		kfree(param_value);
> +		kfree(default_value);
> +		return -ENOMEM;
> +	}
> +
>  	/* Get value from driver part to driverinit configuration mode */
>  	for (i = 0; i <= DEVLINK_PARAM_CMODE_MAX; i++) {
>  		if (!devlink_param_cmode_is_supported(param, i))
>  			continue;
>  		if (i == DEVLINK_PARAM_CMODE_DRIVERINIT) {
> -			if (param_item->driverinit_value_new_valid)
> +			if (param_item->driverinit_value_new_valid) {
>  				param_value[i] = param_item->driverinit_value_new;
> -			else if (param_item->driverinit_value_valid)
> +			} else if (param_item->driverinit_value_valid) {
>  				param_value[i] = param_item->driverinit_value;
> -			else
> -				return -EOPNOTSUPP;
> +			} else {
> +				err = -EOPNOTSUPP;
> +				goto get_put_fail;
> +			}
>  
>  			if (param_item->driverinit_value_valid) {
>  				default_value[i] = param_item->driverinit_default;
>  				default_value_set[i] = true;
>  			}
>  		} else {
> -			ctx.cmode = i;
> -			err = devlink_param_get(devlink, param, &ctx, extack);
> +			ctx->cmode = i;
> +			err = devlink_param_get(devlink, param, ctx, extack);
>  			if (err)
> -				return err;
> -			param_value[i] = ctx.val;
> +				goto get_put_fail;
> +			param_value[i] = ctx->val;
>  
> -			err = devlink_param_get_default(devlink, param, &ctx,
> +			err = devlink_param_get_default(devlink, param, ctx,
>  							extack);
>  			if (!err) {
> -				default_value[i] = ctx.val;
> +				default_value[i] = ctx->val;
>  				default_value_set[i] = true;
>  			} else if (err != -EOPNOTSUPP) {
> -				return err;
> +				goto get_put_fail;
>  			}
>  		}
>  		param_value_set[i] = true;
>  	}
>  
> +	err = -EMSGSIZE;
>  	hdr = genlmsg_put(msg, portid, seq, &devlink_nl_family, flags, cmd);
>  	if (!hdr)
> -		return -EMSGSIZE;
> +		goto get_put_fail;
>  
>  	if (devlink_nl_put_handle(msg, devlink))
>  		goto genlmsg_cancel;
> @@ -393,6 +423,9 @@ static int devlink_nl_param_fill(struct sk_buff *msg, struct devlink *devlink,
>  	nla_nest_end(msg, param_values_list);
>  	nla_nest_end(msg, param_attr);
>  	genlmsg_end(msg, hdr);
> +	kfree(default_value);
> +	kfree(param_value);
> +	kfree(ctx);
>  	return 0;
>  
>  values_list_nest_cancel:
> @@ -401,7 +434,11 @@ static int devlink_nl_param_fill(struct sk_buff *msg, struct devlink *devlink,
>  	nla_nest_cancel(msg, param_attr);
>  genlmsg_cancel:
>  	genlmsg_cancel(msg, hdr);
> -	return -EMSGSIZE;
> +get_put_fail:
> +	kfree(default_value);
> +	kfree(param_value);
> +	kfree(ctx);
> +	return err;
>  }
>  
>  static void devlink_param_notify(struct devlink *devlink,
> @@ -507,7 +544,7 @@ devlink_param_value_get_from_info(const struct devlink_param *param,
>  				  union devlink_param_value *value)
>  {
>  	struct nlattr *param_data;
> -	int len;
> +	int len, cnt, rem;
>  
>  	param_data = info->attrs[DEVLINK_ATTR_PARAM_VALUE_DATA];
>  
> @@ -547,6 +584,26 @@ devlink_param_value_get_from_info(const struct devlink_param *param,
>  			return -EINVAL;
>  		value->vbool = nla_get_flag(param_data);
>  		break;
> +
> +	case DEVLINK_PARAM_TYPE_U64_ARRAY:
> +		cnt = 0;
> +		nla_for_each_attr_type(param_data,
> +				       DEVLINK_ATTR_PARAM_VALUE_DATA,
> +				       genlmsg_data(info->genlhdr),
> +				       genlmsg_len(info->genlhdr), rem) {
> +			if (cnt >= __DEVLINK_PARAM_MAX_ARRAY_SIZE)
> +				return -EMSGSIZE;
> +
> +			if ((nla_len(param_data) != sizeof(u64)) &&
> +			    (nla_len(param_data) != sizeof(u32)))
> +				return -EINVAL;
> +
> +			value->u64arr.val[cnt] = (u64)nla_get_uint(param_data);
> +			cnt++;
> +		}
> +
> +		value->u64arr.size = cnt;
> +		break;

Sashiko says:

---
Does this make it impossible to set an empty array to clear a
multi-value parameter?
If userspace provides 0 elements, param_data will be NULL. Earlier in
devlink_param_value_get_from_info(), there is a check:
	param_data = info->attrs[DEVLINK_ATTR_PARAM_VALUE_DATA];
	if (param->type != DEVLINK_PARAM_TYPE_BOOL && !param_data)
		return -EINVAL;
If the parameter is a U64_ARRAY and no data is provided, this check will
immediately return -EINVAL.
The kernel can successfully emit an empty array on a GET request if the
size is 0. Should the SET path similarly support receiving 0 elements to
allow userspace to clear a multi-value parameter?
---

There are several others NIC-specific remarks, which IMHO are mostly
pre-existing issues, but please have a look:

https://sashiko.dev/#/patchset/20260403025533.6250-1-rkannoth%40marvell.com

/P


  reply	other threads:[~2026-04-07  9:58 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-04-03  2:55 [PATCH v10 net-next 0/6] octeontx2-af: npc: Enhancements Ratheesh Kannoth
2026-04-03  2:55 ` [PATCH v10 net-next 1/6] octeontx2-af: npc: cn20k: debugfs enhancements Ratheesh Kannoth
2026-04-03  2:55 ` [PATCH v10 net-next 2/6] net/mlx5e: heap-allocate devlink param values Ratheesh Kannoth
2026-04-03  2:55 ` [PATCH v10 net-next 3/6] devlink: Implement devlink param multi attribute nested data values Ratheesh Kannoth
2026-04-07  9:58   ` Paolo Abeni [this message]
2026-04-03  2:55 ` [PATCH v10 net-next 4/6] octeontx2-af: npc: cn20k: add subbank search order control Ratheesh Kannoth
2026-04-03  2:55 ` [PATCH v10 net-next 5/6] octeontx2-af: npc: cn20k: dynamically allocate and free default MCAM entries Ratheesh Kannoth
2026-04-03  2:55 ` [PATCH v10 net-next 6/6] octeontx2-af: npc: Support for custom KPU profile from filesystem Ratheesh Kannoth

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=c14a0783-a69f-448d-a464-2d802e6d0ec7@redhat.com \
    --to=pabeni@redhat.com \
    --cc=andrew+netdev@lunn.ch \
    --cc=chuck.lever@oracle.com \
    --cc=cjubran@nvidia.com \
    --cc=davem@davemloft.net \
    --cc=donald.hunter@gmail.com \
    --cc=dtatulea@nvidia.com \
    --cc=edumazet@google.com \
    --cc=horms@kernel.org \
    --cc=jiri@resnulli.us \
    --cc=kuba@kernel.org \
    --cc=leon@kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-rdma@vger.kernel.org \
    --cc=matttbe@kernel.org \
    --cc=mbloch@nvidia.com \
    --cc=netdev@vger.kernel.org \
    --cc=rkannoth@marvell.com \
    --cc=saeedm@nvidia.com \
    --cc=sgoutham@marvell.com \
    --cc=tariqt@nvidia.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox