public inbox for virtio-dev@lists.linux.dev
 help / color / mirror / Atom feed
From: Xuan Zhuo <xuanzhuo@linux.alibaba.com>
To: Jason Wang <jasowang@redhat.com>
Cc: virtio-dev@lists.oasis-open.org, mst@redhat.com,
	parav@nvidia.com, virtio-comment@lists.oasis-open.org, "Zhu,
	Lingshan" <lingshan.zhu@intel.com>
Subject: [virtio-dev] Re: [virtio-comment] [RFC PATCH] admin-queue: bind the group member to the device
Date: Tue, 27 Jun 2023 18:50:46 +0800	[thread overview]
Message-ID: <1687863046.3264692-9-xuanzhuo@linux.alibaba.com> (raw)
In-Reply-To: <CACGkMEs=2HJ=Bg5Xk1GYZAOKpNLpf5EUrkNEkEOKU4hSbXC=mw@mail.gmail.com>

On Tue, 27 Jun 2023 17:00:06 +0800, Jason Wang <jasowang@redhat.com> wrote:
> On Tue, Jun 27, 2023 at 4:28 PM Xuan Zhuo <xuanzhuo@linux.alibaba.com> wrote:
> >
> >
> > Thanks Parav for pointing it out. We may have some gaps on the case.
> >
> > Let me introduce our case, which I think it is simple and should be easy to
> > understand.
> >
> > First, the user (customer) purchased a bare metal machine.
> >
> > ## Bare metal machine
> >
> > Let me briefly explain the characteristics of a bare metal machine. It is not a
> > virtual machine, it is a physical machine, and the difference between it and a
> > general physical machine is that its PCI is connected to a device similar to a
> > DPU. This DPU provides devices such as virtio-blk/net to the host through PCI.
> > These devices are managed by the vendor, and must be created and purchased
> > on the vendor's management platform.
> >
> > ## DPU
> >
> > There is a software implementation in the DPU, which will respond to PCI
> > operations. But as mentioned above, resources such as network cards must be
> > purchased and created before they can exist. So users can create VF, which is
> > just a pci-level operation, but there may not be a corresponding backend.
> >
> > ## Management Platform
> >
> > The creation and configuration of devices is realized on the management
> > platform.
> >
> > After the user completed the purchase on the management platform (this is an
> > independent platform provided by the vendor and has nothing to do with
> > virtio), then there will be a corresponding device implementation in the DPU.
> > This includes some user configurations, available bandwidth resources and other
> > information.
> >
> > ## Usage
> >
> > Since the user is directly on the HOST, the user can create VMs, passthrough PF
> > or VF into the VM. Or users can create a large number of dockers, all of which
> > use a separate virtio-net device for performance.
> >
> > The reason why users use vf is that we need to use a large number of virtio-net
> > devices. This number reaches 1k+.
> >
> > Based on this scenario, we need to bind vf to the backend device. Because, we
> > cannot automatically complete the creation of the virtio-net backend device when
> > the user creates a vf.
> >
> > ## Migration
> >
> > In addition, let's consider another scenario of migration. If a vm is migrated
> > from another host, of course its corresponding virtio device is also migrated to
> > the DPU. At this time, our newly created vf can only be used by the vm after it
> > is bound to the migrated device. We do not want this vf to be a brand new
> > device.
> >
> > ## Abstraction
> >
> > So, this is how I understand the process of creating vf:
> >
> > 1. Create a PCI VF, at this time there may be no backend virtio device, or there
> >     is only a default backend. It does not fully meet our expectations.
> > 2. Create device or migrate device
> > 3. Bind the backend virtio device to the vf
>
> 3) should come before 2)?
>
> Who is going to do 3) btw, is it the user? If yes, for example, if a
> user wants another 4 queue virtio-net devices, after purchase, how
> does the user know its id?

Got the id from the management platform.

>
> >
> > In most scenarios, the first step may be enough. We can make some fine-tuning on
> > this default device, such as modifying its mac. In the future, we can use admin
> > queue to modify its msix vector and other configurations.
> >
> > But we should allow, we bind a backend virtio device to a certain vf. This is
> > useful for live migration and virtio devices with special configurations.
>
> All of these could be addressed if a dynamic provisioning model is
> implemented (SIOV or transport virtqueue). Trying to have a workaround
> in SR-IOV might be tricky.


SR-IOV vf is native PCI device, this is the advancement.


>
> >
> > The design of virtio itself is two layers, and virtio should allow switching the
> > transport layer by nature. This is our advantage.
>
> Is it not switching the transport layer but about binding/unbinding
> vitio devices to VF?

YES.

>
> Is a new capability or similar admin cmd sufficient in this case?

All is ok.


>
> struct virtio_pci_bind_cap {
>         struct virtio_pci_cap cap;
>         u16 bind; // virtio_device_id
>         u16 unbind; // virtio_device_id
> };

You mean that the "bind" or "unbind" is writeable?

Thanks.

>
> Thanks
>
> >
> > ## About the identity
> >
> > In this patch, I used a vendor's id. The purpose of this is that I hope to
> > be compatible with all devices. In the network scenario, it is actually possible
> > to use a mac.
> >
> > Perhaps, introduce a standard id for each device/driver, I think this may be
> > more in line with the habit of virtio spec.
> >
> > Thanks.
> >
> > On Mon, 26 Jun 2023 14:22:10 +0800, Xuan Zhuo <xuanzhuo@linux.alibaba.com> wrote:
> > > The VFs of the SR-IOV are created by the user inside the guest OS, so the virtio
> > > devices don't know about these VFs. Because each VF may be assigned a different role
> > > by the user, the virtio device can not choose one VF to bind random.
> > > So only the user knows how to bind the virtio devices to the VFs.
> > > On the other hand, generally the virtio devices are not created by the user
> > > inside the guest OS. This requires some management platform to participate.
> > >
> > > So the usage of this command:
> > > 1. The user purchases a virtio network card on the management platform,
> > >    and sets the ip, queue number, etc. The user obtains the identity of
> > >    the network card.
> > > 2. The user creates a VF with echo 8 > sriov_numvfs
> > > 3. The user binds the net crad to a VF with identity through the command
> > >    of the patch
> > >
> > > Signed-off-by: Xuan Zhuo <xuanzhuo@linux.alibaba.com>
> > > ---
> > >  admin.tex | 41 ++++++++++++++++++++++++++++++++++++++++-
> > >  1 file changed, 40 insertions(+), 1 deletion(-)
> > >
> > > diff --git a/admin.tex b/admin.tex
> > > index 2efd4d7..64d0667 100644
> > > --- a/admin.tex
> > > +++ b/admin.tex
> > > @@ -115,7 +115,8 @@ \subsection{Group administration commands}\label{sec:Basic Facilities of a Virti
> > >  \hline \hline
> > >  0x0000 & VIRTIO_ADMIN_CMD_LIST_QUERY & Provides to driver list of commands supported for this group type    \\
> > >  0x0001 & VIRTIO_ADMIN_CMD_LIST_USE & Provides to device list of commands used for this group type \\
> > > -0x0002 - 0x7FFF & - & Commands using \field{struct virtio_admin_cmd}    \\
> > > +0x0002 & VIRTIO_ADMIN_CMD_BIND_DEVICE & Bind the device to one group member \\
> > > +0x0003 - 0x7FFF & - & Commands using \field{struct virtio_admin_cmd}    \\
> > >  \hline
> > >  0x8000 - 0xFFFF & - & Reserved for future commands (possibly using a different structure)    \\
> > >  \hline
> > > @@ -429,6 +430,44 @@ \subsection{Group administration commands}\label{sec:Basic Facilities of a Virti
> > >  \field{VF Enable} refer to registers within the SR-IOV Extended
> > >  Capability as specified by \hyperref[intro:PCIe]{[PCIe]}.
> > >
> > > +\subsubsection{Bind the device for member}
> > > +
> > > +The VFs of the SR-IOV are created by the user inside the guest OS, so the virtio
> > > +devices don't know about these VFs. Because each VF may be assigned a different role
> > > +by the user, the virtio device can not choose one VF to bind random.
> > > +So only the user knows how to bind the virtio devices to the VFs.
> > > +On the other hand, generally the virtio devices are not created by the user
> > > +inside the guest OS. This requires some management platform to participate.
> > > +
> > > +So we introduce a new admin queue command to bind the VFs and the virtio
> > > +devices.
> > > +
> > > +\begin{lstlisting}
> > > +struct virtio_admin_cmd_bind {
> > > +    u64 identity;
> > > +};
> > > +\end{lstlisting}
> > > +
> > > +The user got the \field{identity} from the management platform, that is not
> > > +included by this spec.
> > > +
> > > +\drivernormative{\paragraph}{Group administration commands}{Basic Facilities of a Virtio Device / Device groups / Group administration commands / Bind the device for member}
> > > +
> > > +VIRTIO_ADMIN_CMD_BIND_DEVICE requires that the \field{group_member_id} MUST be set.
> > > +
> > > +The \field{identity} is passed by the user. It is the identity of the virtio
> > > +device.
> > > +
> > > +\devicenormative{\paragraph}{Group administration commands}{Basic Facilities of a Virtio Device / Device groups / Group administration commands / Bind the device for member}
> > > +
> > > +Every device MUST have one unique \field{identity} in the host.
> > > +
> > > +If the PF device can not find the device by the \field{identity},
> > > +the \field{status} MUST be set to VIRTIO_ADMIN_STATUS_EINVAL.
> > > +
> > > +If the device is found by the \field{identity}, the device MUST work as the
> > > +device of this group member specified by the \field{group_member_id}.
> > > +
> > >  \section{Administration Virtqueues}\label{sec:Basic Facilities of a Virtio Device / Administration Virtqueues}
> > >
> > >  An administration virtqueue of an owner device is used to submit
> > > --
> > > 2.32.0.3.g01195cf9f
> > >
> > >
> > > This publicly archived list offers a means to provide input to the
> > > OASIS Virtual I/O Device (VIRTIO) TC.
> > >
> > > In order to verify user consent to the Feedback License terms and
> > > to minimize spam in the list archive, subscription is required
> > > before posting.
> > >
> > > Subscribe: virtio-comment-subscribe@lists.oasis-open.org
> > > Unsubscribe: virtio-comment-unsubscribe@lists.oasis-open.org
> > > List help: virtio-comment-help@lists.oasis-open.org
> > > List archive: https://lists.oasis-open.org/archives/virtio-comment/
> > > Feedback License: https://www.oasis-open.org/who/ipr/feedback_license.pdf
> > > List Guidelines: https://www.oasis-open.org/policies-guidelines/mailing-lists
> > > Committee: https://www.oasis-open.org/committees/virtio/
> > > Join OASIS: https://www.oasis-open.org/join/
> > >
> >
>

---------------------------------------------------------------------
To unsubscribe, e-mail: virtio-dev-unsubscribe@lists.oasis-open.org
For additional commands, e-mail: virtio-dev-help@lists.oasis-open.org


  reply	other threads:[~2023-06-27 10:54 UTC|newest]

Thread overview: 38+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-06-26  6:22 [virtio-dev] [RFC PATCH] admin-queue: bind the group member to the device Xuan Zhuo
2023-06-26  6:43 ` [virtio-dev] Re: [virtio-comment] " Zhu, Lingshan
2023-06-26  7:08   ` Xuan Zhuo
2023-06-26  7:57     ` Zhu, Lingshan
2023-06-26  8:09       ` Xuan Zhuo
2023-06-26  8:59         ` Zhu, Lingshan
2023-06-26  9:16           ` Xuan Zhuo
2023-06-26  9:32             ` [virtio-dev] Re: [virtio-comment] " Xuan Zhuo
2023-06-26  9:56             ` Zhu, Lingshan
2023-06-26 10:50               ` Xuan Zhuo
2023-06-26 12:19                 ` [virtio-dev] " Parav Pandit
2023-06-26 12:32                   ` [virtio-dev] " Xuan Zhuo
2023-06-26 13:01                     ` [virtio-dev] " Parav Pandit
2023-06-26 12:35                   ` [virtio-dev] " Michael S. Tsirkin
2023-06-26 12:39                     ` Xuan Zhuo
2023-06-26 22:46                     ` [virtio-dev] " Parav Pandit
2023-06-27  2:57                 ` [virtio-dev] " Zhu, Lingshan
2023-06-27  8:14                   ` Xuan Zhuo
2023-06-27  9:04                     ` Zhu, Lingshan
2023-06-26  9:32 ` [virtio-dev] " Michael S. Tsirkin
2023-06-26  9:35   ` [virtio-dev] Re: [virtio-comment] " Xuan Zhuo
2023-06-27  8:08 ` [virtio-dev] Re: [virtio-comment] " Jason Wang
2023-06-27  8:16   ` Xuan Zhuo
2023-06-27  8:23 ` Xuan Zhuo
2023-06-27  9:00   ` Jason Wang
2023-06-27 10:50     ` Xuan Zhuo [this message]
2023-06-28  2:49       ` Jason Wang
2023-06-28  6:06         ` Xuan Zhuo
2023-06-28 15:55           ` Michael S. Tsirkin
2023-06-29  3:29             ` Jason Wang
2023-06-27 15:03   ` [virtio-dev] " Parav Pandit
2023-06-27 16:02   ` [virtio-dev] " Michael S. Tsirkin
2023-06-28  2:21     ` Xuan Zhuo
2023-06-28 15:06       ` [virtio-dev] " Parav Pandit
2023-06-28 15:41       ` [virtio-dev] " Michael S. Tsirkin
2023-07-03  3:21         ` Xuan Zhuo
2023-07-03  7:42           ` Jason Wang
2023-07-03 20:03           ` Parav Pandit

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1687863046.3264692-9-xuanzhuo@linux.alibaba.com \
    --to=xuanzhuo@linux.alibaba.com \
    --cc=jasowang@redhat.com \
    --cc=lingshan.zhu@intel.com \
    --cc=mst@redhat.com \
    --cc=parav@nvidia.com \
    --cc=virtio-comment@lists.oasis-open.org \
    --cc=virtio-dev@lists.oasis-open.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox