All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Michael S. Tsirkin" <mst@redhat.com>
To: Jason Wang <jasowang@redhat.com>
Cc: Heng Qi <hengqi@linux.alibaba.com>,
	virtio-comment@lists.oasis-open.org,
	Parav Pandit <parav@nvidia.com>,
	Xuan Zhuo <xuanzhuo@linux.alibaba.com>
Subject: Re: [virtio-comment] [PATCH v2] virtio-net: support distinguishing between partial and full checksum
Date: Thu, 2 Nov 2023 02:50:03 -0400	[thread overview]
Message-ID: <20231102024425-mutt-send-email-mst@kernel.org> (raw)
In-Reply-To: <CACGkMEucwwx2p1Y-3DN3yBAA6_Mu6pdmKxpCj8pDbD6YxjvFPg@mail.gmail.com>

On Thu, Nov 02, 2023 at 12:40:03PM +0800, Jason Wang wrote:
> On Wed, Nov 1, 2023 at 1:37 PM Michael S. Tsirkin <mst@redhat.com> wrote:
> >
> > On Wed, Nov 01, 2023 at 12:16:23PM +0800, Jason Wang wrote:
> > > On Sat, Oct 28, 2023 at 10:36 AM Heng Qi <hengqi@linux.alibaba.com> wrote:
> > > >
> > > >
> > > >
> > > > 在 2023/10/27 下午3:39, Michael S. Tsirkin 写道:
> > > > > On Thu, Oct 19, 2023 at 02:17:20PM +0800, Heng Qi wrote:
> > > > >> virtio-net works in a virtualized system and is somewhat different from
> > > > >> physical nics. One of the differences is that to save virtio device
> > > > >> resources, rx may receive packets with partial checksum. However, XDP may
> > > > >> cause partially checksummed packets to be dropped. So XDP loading conflicts
> > > > >> with the feature VIRTIO_NET_F_GUEST_CSUM.
> > > > >>
> > > > >> This patch lets the device to supply fully checksummed packets to the driver.
> > > > >> Then XDP can coexist with VIRTIO_NET_F_GUEST_CSUM to enjoy the benefits of
> > > > >> device verification checksum.
> > > > >>
> > > > >> In addition, implementation of some performant devices do not generate
> > > > >> partially checksummed packets, but the standard driver still need to clear
> > > > >> VIRTIO_NET_F_GUEST_CSUM when loading XDP. If these devices enable the
> > > > >> full checksum offloading, then the driver can load XDP without clearing
> > > > >> VIRTIO_NET_F_GUEST_CSUM.
> > > > >>
> > > > >> A new feature bit VIRTIO_NET_F_GUEST_FULL_CSUM is added to solve the above
> > > > >> situation, which provides the driver with configurable receive full checksum
> > > > >> offload. If the offload is enabled, then the device must supply fully
> > > > >> checksummed packets to the driver.
> > > > >>
> > > > >> Use case example:
> > > > >> If VIRTIO_NET_F_GUEST_FULL_CSUM is negotiated and receive full checksum
> > > > >> offload is enabled, after XDP processes a packet with full checksum, the
> > > > >> VIRTIO_NET_HDR_F_DATA_VALID bit is still retained, resulting in the stack
> > > > >> not needing to validate the checksum again. This is useful for guests:
> > > > >>    1. Bring the driver advantages such as cpu savings.
> > > > >>    2. For devices that do not generate partially checksummed packets themselves,
> > > > >>       XDP can be loaded in the driver without modifying the hardware behavior.
> > > > >>
> > > > >> Several solutions have been discussed in the previous proposal[1].
> > > > >> After historical discussion, we have tried the method proposed by Jason[2],
> > > > >> but some complex scenarios and challenges are difficult to deal with.
> > > > >> We now return to the method suggested in [1].
> > > > >>
> > > > >> [1] https://lists.oasis-open.org/archives/virtio-dev/202305/msg00291.html
> > > > >> [2] https://lore.kernel.org/all/20230628030506.2213-1-hengqi@linux.alibaba.com/
> > > > >>
> > > > >> Signed-off-by: Heng Qi <hengqi@linux.alibaba.com>
> > > > >> Reviewed-by: Xuan Zhuo <xuanzhuo@linux.alibaba.com>
> > > > >> ---
> > > > >> v1->v2:
> > > > >>      1. Modify full checksum functionality as a configurable offload
> > > > >>         that is initially turned off. @Jason
> > > > >>
> > > > >>   device-types/net/description.tex | 54 ++++++++++++++++++++++++++++----
> > > > >>   1 file changed, 48 insertions(+), 6 deletions(-)
> > > > >>
> > > > >> diff --git a/device-types/net/description.tex b/device-types/net/description.tex
> > > > >> index 76585b0..3c34f27 100644
> > > > >> --- a/device-types/net/description.tex
> > > > >> +++ b/device-types/net/description.tex
> > > > >> @@ -88,6 +88,8 @@ \subsection{Feature bits}\label{sec:Device Types / Network Device / Feature bits
> > > > >>   \item[VIRTIO_NET_F_CTRL_MAC_ADDR(23)] Set MAC address through control
> > > > >>       channel.
> > > > >>
> > > > >> +\item[VIRTIO_NET_F_GUEST_FULL_CSUM (50)] Driver handles packets with full checksum.
> > > > >> +
> > > > >>   \item[VIRTIO_NET_F_HASH_TUNNEL(51)] Device supports inner header hash for encapsulated packets.
> > > > >>
> > > > >>   \item[VIRTIO_NET_F_VQ_NOTF_COAL(52)] Device supports virtqueue notification coalescing.
> > > > >> @@ -133,6 +135,7 @@ \subsubsection{Feature bit requirements}\label{sec:Device Types / Network Device
> > > > >>   \item[VIRTIO_NET_F_GUEST_UFO] Requires VIRTIO_NET_F_GUEST_CSUM.
> > > > >>   \item[VIRTIO_NET_F_GUEST_USO4] Requires VIRTIO_NET_F_GUEST_CSUM.
> > > > >>   \item[VIRTIO_NET_F_GUEST_USO6] Requires VIRTIO_NET_F_GUEST_CSUM.
> > > > >> +\item[VIRTIO_NET_F_GUEST_FULL_CSUM] Requires VIRTIO_NET_F_GUEST_CSUM and VIRTIO_NET_F_CTRL_GUEST_OFFLOADS.
> > > > >>
> > > > >>   \item[VIRTIO_NET_F_HOST_TSO4] Requires VIRTIO_NET_F_CSUM.
> > > > >>   \item[VIRTIO_NET_F_HOST_TSO6] Requires VIRTIO_NET_F_CSUM.
> > > > > What about all of these:
> > > > >
> > > > > device-types/net/description.tex:\item[VIRTIO_NET_F_GUEST_TSO4] Requires VIRTIO_NET_F_GUEST_CSUM.
> > > > > device-types/net/description.tex:\item[VIRTIO_NET_F_GUEST_TSO6] Requires VIRTIO_NET_F_GUEST_CSUM.
> > > > > device-types/net/description.tex:\item[VIRTIO_NET_F_GUEST_UFO] Requires VIRTIO_NET_F_GUEST_CSUM.
> > > > > device-types/net/description.tex:\item[VIRTIO_NET_F_GUEST_USO4] Requires VIRTIO_NET_F_GUEST_CSUM.
> > > > > device-types/net/description.tex:\item[VIRTIO_NET_F_GUEST_USO6] Requires VIRTIO_NET_F_GUEST_CSUM.
> > > > >
> > > > >
> > > > >
> > > > > can TSO/UFO/USO work with VIRTIO_NET_F_GUEST_FULL_CSUM as opposed to VIRTIO_NET_F_GUEST_CSUM?
> > > >
> > > > Both GUEST_FULL_CSUM and GUEST_CSUM can work with GUEST_TSO/USO/UFO.
> > >
> > > Yes. For software devices I guess it will have a lot of performance
> > > penalty. So it should be disabled by default anyhow. The idea is to
> > > delay the csum as late as possible.
> >
> > But for hardware it's actually better.
> 
> I can't think of a case where it might be better than XDP.

Of course CHECKSUM_COMPLETE is better than CHECKSUM_PARTIAL if you can
support it: you don't even need to look at csum_start + csum_offset.
And the hardware doesn't need to parse L3/L4 headers to implement
CHECKSUM_COMPLETE.


> Most userspace doesn't care about the checksum though.
> 
> > Maybe we need a flag
> > to say which offloads are expensive?
> >
> 
> That exposes some device details which seem not good (e.g we may want
> to do migration among hardware and software).
> 
> Thanks

If you do then things will be less well tuned on one of the migration
ends but then that is by design, isn't it?

-- 
MST


This publicly archived list offers a means to provide input to the
OASIS Virtual I/O Device (VIRTIO) TC.

In order to verify user consent to the Feedback License terms and
to minimize spam in the list archive, subscription is required
before posting.

Subscribe: virtio-comment-subscribe@lists.oasis-open.org
Unsubscribe: virtio-comment-unsubscribe@lists.oasis-open.org
List help: virtio-comment-help@lists.oasis-open.org
List archive: https://lists.oasis-open.org/archives/virtio-comment/
Feedback License: https://www.oasis-open.org/who/ipr/feedback_license.pdf
List Guidelines: https://www.oasis-open.org/policies-guidelines/mailing-lists
Committee: https://www.oasis-open.org/committees/virtio/
Join OASIS: https://www.oasis-open.org/join/


  reply	other threads:[~2023-11-02  6:50 UTC|newest]

Thread overview: 17+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-10-19  6:17 [virtio-comment] [PATCH v2] virtio-net: support distinguishing between partial and full checksum Heng Qi
2023-10-27  2:35 ` Heng Qi
2023-10-27  7:39 ` Michael S. Tsirkin
2023-10-28  1:53   ` Xuan Zhuo
2023-10-28  2:36   ` Heng Qi
2023-11-01  4:16     ` Jason Wang
2023-11-01  4:59       ` Heng Qi
2023-11-02  5:30         ` Jason Wang
2023-11-02  6:59           ` Michael S. Tsirkin
2023-11-06  6:51             ` Heng Qi
2023-11-01  5:37       ` Michael S. Tsirkin
2023-11-01  6:46         ` Heng Qi
2023-11-02  4:40         ` Jason Wang
2023-11-02  6:50           ` Michael S. Tsirkin [this message]
2023-11-09  3:55             ` Jason Wang
2023-11-09  8:01               ` Michael S. Tsirkin
2023-11-09  8:42                 ` Heng Qi

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20231102024425-mutt-send-email-mst@kernel.org \
    --to=mst@redhat.com \
    --cc=hengqi@linux.alibaba.com \
    --cc=jasowang@redhat.com \
    --cc=parav@nvidia.com \
    --cc=virtio-comment@lists.oasis-open.org \
    --cc=xuanzhuo@linux.alibaba.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.