From: Jason Wang <jasowang@redhat.com>
To: Si-Wei Liu <si-wei.liu@oracle.com>
Cc: virtualization@lists.linux-foundation.org,
linux-kernel@vger.kernel.org, Cindy Lu <lulu@redhat.com>,
mst@redhat.com
Subject: Re: [PATCH v3 3/4] vdpa: show dev config as-is in "vdpa dev show" output
Date: Wed, 26 Oct 2022 12:44:44 +0800 [thread overview]
Message-ID: <38b9648b-8795-5b41-b06c-d13a1a6d5d3f@redhat.com> (raw)
In-Reply-To: <25a18d7a-ebae-3647-32f0-14ad733118be@oracle.com>
在 2022/10/26 09:10, Si-Wei Liu 写道:
>
>
> On 10/24/2022 7:24 PM, Jason Wang wrote:
>> On Tue, Oct 25, 2022 at 3:14 AM Si-Wei Liu<si-wei.liu@oracle.com> wrote:
>>> On 10/24/2022 1:40 AM, Jason Wang wrote:
>>>> On Sat, Oct 22, 2022 at 7:49 AM Si-Wei Liu<si-wei.liu@oracle.com> wrote:
>>>>> Live migration of vdpa would typically require re-instate vdpa
>>>>> device with an idential set of configs on the destination node,
>>>>> same way as how source node created the device in the first
>>>>> place. In order to save orchestration software from memorizing
>>>>> and keeping track of vdpa config, it will be helpful if the vdpa
>>>>> tool provides the aids for exporting the initial configs as-is,
>>>>> the way how vdpa device was created. The "vdpa dev show" command
>>>>> seems to be the right vehicle for that. It is unlike the "vdpa dev
>>>>> config show" command output which usually goes with the live value
>>>>> in the device config space, and is not quite reliable subject to
>>>>> the dynamics of feature negotiation or possible change by the
>>>>> driver to the config space.
>>>>>
>>>>> Examples:
>>>>>
>>>>> 1) Create vDPA by default without any config attribute
>>>>>
>>>>> $ vdpa dev add mgmtdev pci/0000:41:04.2 name vdpa0
>>>>> $ vdpa dev show vdpa0
>>>>> vdpa0: type network mgmtdev pci/0000:41:04.2 vendor_id 5555 max_vqs 9 max_vq_size 256
>>>>> $ vdpa dev -jp show vdpa0
>>>>> {
>>>>> "dev": {
>>>>> "vdpa0": {
>>>>> "type": "network",
>>>>> "mgmtdev": "pci/0000:41:04.2",
>>>>> "vendor_id": 5555,
>>>>> "max_vqs": 9,
>>>>> "max_vq_size": 256,
>>>>> }
>>>>> }
>>>>> }
>>>>>
>>>>> 2) Create vDPA with config attribute(s) specified
>>>>>
>>>>> $ vdpa dev add mgmtdev pci/0000:41:04.2 name vdpa0 \
>>>>> mac e4:11:c6:d3:45:f0 max_vq_pairs 4
>>>>> $ vdpa dev show
>>>>> vdpa0: type network mgmtdev pci/0000:41:04.2 vendor_id 5555 max_vqs 9 max_vq_size 256
>>>>> initial_config: mac e4:11:c6:d3:45:f0 max_vq_pairs 4
>>>>> $ vdpa dev -jp show
>>>>> {
>>>>> "dev": {
>>>>> "vdpa0": {
>>>>> "type": "network",
>>>>> "mgmtdev": "pci/0000:41:04.2",
>>>>> "vendor_id": 5555,
>>>>> "max_vqs": 9,
>>>>> "max_vq_size": 256,
>>>>> "initial_config": {
>>>>> "mac": "e4:11:c6:d3:45:f0",
>>>>> "max_vq_pairs": 4
>>>>> }
>>>>> }
>>>>> }
>>>>> }
>>>>>
>>>>> Signed-off-by: Si-Wei Liu<si-wei.liu@oracle.com>
>>>>> ---
>>>>> drivers/vdpa/vdpa.c | 39 +++++++++++++++++++++++++++++++++++++++
>>>>> 1 file changed, 39 insertions(+)
>>>>>
>>>>> diff --git a/drivers/vdpa/vdpa.c b/drivers/vdpa/vdpa.c
>>>>> index bebded6..bfb8f54 100644
>>>>> --- a/drivers/vdpa/vdpa.c
>>>>> +++ b/drivers/vdpa/vdpa.c
>>>>> @@ -677,6 +677,41 @@ static int vdpa_nl_cmd_dev_del_set_doit(struct sk_buff *skb, struct genl_info *i
>>>>> }
>>>>>
>>>>> static int
>>>>> +vdpa_dev_initcfg_fill(struct vdpa_device *vdev, struct sk_buff *msg, u32 device_id)
>>>>> +{
>>>>> + struct vdpa_dev_set_config *cfg = &vdev->init_cfg;
>>>>> + int err = -EMSGSIZE;
>>>>> +
>>>>> + if (!cfg->mask)
>>>>> + return 0;
>>>>> +
>>>>> + switch (device_id) {
>>>>> + case VIRTIO_ID_NET:
>>>>> + if ((cfg->mask & BIT_ULL(VDPA_ATTR_DEV_NET_CFG_MACADDR)) != 0 &&
>>>>> + nla_put(msg, VDPA_ATTR_DEV_NET_CFG_MACADDR,
>>>>> + sizeof(cfg->net.mac), cfg->net.mac))
>>>>> + return err;
>>>>> + if ((cfg->mask & BIT_ULL(VDPA_ATTR_DEV_NET_CFG_MTU)) != 0 &&
>>>>> + nla_put_u16(msg, VDPA_ATTR_DEV_NET_CFG_MTU, cfg->net.mtu))
>>>>> + return err;
>>>>> + if ((cfg->mask & BIT_ULL(VDPA_ATTR_DEV_NET_CFG_MAX_VQP)) != 0 &&
>>>>> + nla_put_u16(msg, VDPA_ATTR_DEV_NET_CFG_MAX_VQP,
>>>>> + cfg->net.max_vq_pairs))
>>>>> + return err;
>>>>> + break;
>>>>> + default:
>>>>> + break;
>>>>> + }
>>>>> +
>>>>> + if ((cfg->mask & BIT_ULL(VDPA_ATTR_DEV_FEATURES)) != 0 &&
>>>>> + nla_put_u64_64bit(msg, VDPA_ATTR_DEV_FEATURES,
>>>>> + cfg->device_features, VDPA_ATTR_PAD))
>>>>> + return err;
>>>> A question: If any of those above attributes were not provisioned,
>>>> should we show the ones that are inherited from the parent?
>>> A simple answer would be yes, but the long answer is that I am not sure
>>> if there's any for the moment - there's no default value for mtu, mac,
>>> and max_vqp that can be inherited from the parent (max_vqp by default
>>> being 1 is spec defined, not something inherited from the parent).
>> Note that it is by default from driver level that if _F_MQ is not
>> negotiated. But I think we are talking about something different that
>> is out of the spec here, what if:
>>
>> vDPA inherit _F_MQ but mqx_vqp is not provisioned via netlink.
>>
>> Or is it not allowed?
> My understanding is that this is not allowed any more since the
> introduction of max_vqp attribute. Noted, currently we don't have a
> way for vendor driver to report the default value for mqx_vqp,
I think it can be reported in this patch?
> if not otherwise specified in the CLI. Without getting the default
> value reported in 'vdpa mgmtdev show' level, it'd just confuse mgmt
> software even more.
Yes, this is something that we need to fix. And what's more in order to
support dynamic provisioning, we need a way to report the number of
available instances that could be used for vDPA device provisioning.
>
>> At least some time in the past, mlx5 were
>> enabled with MQ with 8 queue pairs by default.
> That was the situation when there's no max_vqp attribute support from
> vdpa netlink API level. I think now every driver honors the vdpa core
> disposition to get a single queue pair if max_vqp config is missing.
So we have:
int vdpa_register_device(struct vdpa_device *vdev, int nvqs)
This means technically, parent can allocate a multiqueue devices with
_F_MQ features if max_vqp and device_features is not provisioned. And
what's more, what happens if _F_MQ is provisioned by max_vqp is not
specified?
The question is:
When a attribute is not specificed/provisioned via net link, what's the
default value? The answer should be consistent: if device_features is
determined by the parent, we should do the same for mqx_vqp. And it
looks to me all of those belongs to the initial config (self-contained)
> And the mlx5_vdpa driver with 8 queue pairs in the wild days is just
> irrelevant to be manageable by mgmt software, regardless of live
> migration.
>>> And
>>> the device_features if inherited is displayed at 'vdpa dev config show'
>>> output. Can you remind me of a good example for inherited value that we
>>> may want to show here?
>> Some other cases:
>>
>> 1) MTU: there should be something that the device needs to report if
>> _F_MTU is negotiated even if it is not provisioned from netlink.
> I am not sure I understand the ask here. Noted the QEMU argument has
> to offer host_mtu=X with the maximum MTU value for guest to use (and
> applied as the initial MTU config during virtio-net probing for Linux
> driver),
Adding Cindy.
I think it's a known issue that we need to do sanity check to make sure
cli parameters matches what is provisioned from netlink.
> and the way to get the parent device MTU and whether that's relevant
> to vdpa device's MTU is very vendor specific.
So I think the max MTU of parent should be equal to the max MTU of the vDPA.
> I think we would need new attribute(s) in the mgmtdev level to support
> what you want here?
Not sure, but what I want to ask is consider we provision MTU feature
but without max MTU value, do we need to report the initial max MTU here?
>
>> 2) device_features: if device_features is not provisioned, we should
>> still report it via netlink here
> Not the way I expected it, but with Lingshan's series to expose fields
> out of FEATURES_OK, the device_features is now reported through 'vdpa
> dev config show' regardless being specified or not, if I am not mistaken?
Yes.
>
> Currently we export the config attributes upon vdpa creation under the
> "initial_config" key. If we want to expose more default values
> inherited from mgmtdev, I think we can wrap up these default values
> under another key "inherited_config" to display in 'vdpa dev show'
> output. Does it fit what you have in mind?
I wonder if it's better to merge those two, or is there any advantages
of splitting them?
>
>> or do you mean the mgmt can assume it
>> should be the same as mgmtdev. Anyhow if we don't show device_features
>> if it is not provisioned, it will complicate the mgmt software.
> Yes, as I said earlier, since the device_features attr getting added
> to the 'vdpa dev config show' command, this divergence started to
> complicate mgmt software already.
>
> Thanks,
Thanks
> -Siwei
>> Thanks
>>
>>> Thanks,
>>> -Siwei
>>>
>>>
>>>> Thanks
>>>>
>>>>> +
>>>>> + return 0;
>>>>> +}
>>>>> +
>>>>> +static int
>>>>> vdpa_dev_fill(struct vdpa_device *vdev, struct sk_buff *msg, u32 portid, u32 seq,
>>>>> int flags, struct netlink_ext_ack *extack)
>>>>> {
>>>>> @@ -715,6 +750,10 @@ static int vdpa_nl_cmd_dev_del_set_doit(struct sk_buff *skb, struct genl_info *i
>>>>> if (nla_put_u16(msg, VDPA_ATTR_DEV_MIN_VQ_SIZE, min_vq_size))
>>>>> goto msg_err;
>>>>>
>>>>> + err = vdpa_dev_initcfg_fill(vdev, msg, device_id);
>>>>> + if (err)
>>>>> + goto msg_err;
>>>>> +
>>>>> genlmsg_end(msg, hdr);
>>>>> return 0;
>>>>>
>>>>> --
>>>>> 1.8.3.1
>>>>>
>
_______________________________________________
Virtualization mailing list
Virtualization@lists.linux-foundation.org
https://lists.linuxfoundation.org/mailman/listinfo/virtualization
next prev parent reply other threads:[~2022-10-26 4:45 UTC|newest]
Thread overview: 21+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-10-21 22:43 [PATCH v3 0/4] vDPA: initial config export via "vdpa dev show" Si-Wei Liu
2022-10-21 22:43 ` [PATCH v3 1/4] vdpa: save vdpa_dev_set_config in struct vdpa_device Si-Wei Liu
2022-10-24 8:43 ` Jason Wang
2022-10-21 22:43 ` [PATCH v3 2/4] vdpa: pass initial config to _vdpa_register_device() Si-Wei Liu
2022-10-21 22:43 ` [PATCH v3 3/4] vdpa: show dev config as-is in "vdpa dev show" output Si-Wei Liu
2022-10-24 8:40 ` Jason Wang
2022-10-24 19:14 ` Si-Wei Liu
2022-10-25 2:24 ` Jason Wang
2022-10-26 1:10 ` Si-Wei Liu
2022-10-26 4:44 ` Jason Wang [this message]
2022-10-27 6:31 ` Si-Wei Liu
2022-10-27 8:47 ` Jason Wang
2022-10-28 23:23 ` Si-Wei Liu
2022-12-19 6:31 ` Michael S. Tsirkin
2022-12-21 0:14 ` Si-Wei Liu
2022-12-20 7:58 ` Jason Wang
2022-10-21 22:43 ` [PATCH v3 4/4] vdpa: fix improper error message when adding vdpa dev Si-Wei Liu
2022-10-24 8:43 ` Jason Wang
2023-01-27 8:16 ` [PATCH v3 0/4] vDPA: initial config export via "vdpa dev show" Michael S. Tsirkin
2023-01-30 21:05 ` Si-Wei Liu
2023-01-30 21:59 ` Si-Wei Liu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=38b9648b-8795-5b41-b06c-d13a1a6d5d3f@redhat.com \
--to=jasowang@redhat.com \
--cc=linux-kernel@vger.kernel.org \
--cc=lulu@redhat.com \
--cc=mst@redhat.com \
--cc=si-wei.liu@oracle.com \
--cc=virtualization@lists.linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).