All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Michael S. Tsirkin" <mst@redhat.com>
To: Eugenio Perez Martin <eperezma@redhat.com>
Cc: Maxime Coquelin <mcoqueli@redhat.com>,
	Yongji Xie <xieyongji@bytedance.com>,
	virtualization@lists.linux.dev, linux-kernel@vger.kernel.org,
	Xuan Zhuo <xuanzhuo@linux.alibaba.com>,
	Dragos Tatulea DE <dtatulea@nvidia.com>,
	jasowang@redhat.com
Subject: Re: [RFC 1/2] virtio_net: timeout control virtqueue commands
Date: Wed, 22 Oct 2025 07:43:20 -0400	[thread overview]
Message-ID: <20251022073231-mutt-send-email-mst@kernel.org> (raw)
In-Reply-To: <CAJaqyWdRN3C1nOHQjWroBPWKjc5efNfVbecpruL=Cgsk6i1FXg@mail.gmail.com>

On Wed, Oct 22, 2025 at 12:50:53PM +0200, Eugenio Perez Martin wrote:
> Let me switch to MQ as I think it illustrates the point better.
> 
> IIUC the workflow:
> a) virtio-net sends MQ_VQ_PAIRS_SET 2 to the device
> b) VDUSE CVQ sends ok to the virtio-net driver
> c) VDUSE CVQ sends the command to the VDUSE device
> d) Now the virtio-net driver sends virtio-net sends MQ_VQ_PAIRS_SET 1
> e) VDUSE CVQ sends ok to the virtio-net driver
> 
> The device didn't process the MQ_VQ_PAIRS_SET 1 command at this point,
> so it potentially uses the second rx queue. But, by the standard:
> 
> The device MUST NOT queue packets on receive queues greater than
> virtqueue_pairs once it has placed the VIRTIO_NET_CTRL_MQ_VQ_PAIRS_SET
> command in a used buffer.
> 
> So the driver does not expect rx buffers on that queue at all. From
> the driver's POV, the device is invalid, and it could mark it as
> broken.

ok intresting. Note that if userspace processes vqs it should process
cvq too. I don't know what to do in this case yet, I'm going on
vacation, let me ponder this a bit.


> And, what's worse, how to handle it if the device now replies with
> VIRTIO_NET_ERR to the VDUSE CVQ?

this part does not bother me much. break it, probably.

> > > If we wait for the device to reply, we're in the
> > > same situation regarding the RTNL.
> > >
> > > Now we receive a new state (A, B, E). We haven't sent the (A, B, D),
> > > so it is good to just replace the (A, B, D) with that. and send it
> > > when (A, B, C) is completed with either success or failure.
> > >
> > > 2) VQ_PAIRS_SET
> > >
> > > The driver starts with 1 vq pair. Now the driver sets 3 vq pairs, and
> > > the VDUSE CVQ forwards the command. The driver still thinks that it is
> > > using 1 vq pair. I can store that the driver request was 3, and it is
> > > still in-flight. Now the timeout occurs, so the VDUSE device returns
> > > fail to the driver, and the driver frees the vq regions etc. After
> > > that, the device now replies OK. The memory that was sent as the new
> > > vqs avail ring and descriptor ring now contains garbage, and it could
> > > happen that the device start overriding unrelated memory.
> > >
> > > Not even VQ_RESET protects against it as there is still a window
> > > between the CMD set and the VQ reset.
> >
> > Timeouts should be up to userspace. If userspace times out
> > and then gets confused, kernel is not to blame.
> >
> >
> 
> I meant the virtio-net driver will be confused.


  reply	other threads:[~2025-10-22 11:43 UTC|newest]

Thread overview: 49+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2025-10-07 13:06 [RFC 0/2] Lift restriction about VDUSE net devices with CVQ Eugenio Pérez
2025-10-07 13:06 ` [RFC 1/2] virtio_net: timeout control virtqueue commands Eugenio Pérez
2025-10-11  7:44   ` Jason Wang
2025-10-14  7:30     ` Eugenio Perez Martin
2025-10-14  8:29   ` Michael S. Tsirkin
2025-10-14  9:14     ` Maxime Coquelin
2025-10-14  9:25       ` Michael S. Tsirkin
2025-10-14 10:21         ` Maxime Coquelin
2025-10-15  4:44           ` Jason Wang
2025-10-15  6:07             ` Michael S. Tsirkin
2025-10-15  6:08         ` Eugenio Perez Martin
2025-10-15  6:33           ` Michael S. Tsirkin
2025-10-15  6:52             ` Eugenio Perez Martin
2025-10-15  7:04               ` Michael S. Tsirkin
2025-10-15  7:45                 ` Eugenio Perez Martin
2025-10-15  8:03                   ` Maxime Coquelin
2025-10-15  8:09                     ` Michael S. Tsirkin
2025-10-15  9:16                       ` Maxime Coquelin
2025-10-15 10:36                       ` Eugenio Perez Martin
2025-10-16  5:39                         ` Jason Wang
2025-10-16  5:45                           ` Michael S. Tsirkin
2025-10-16  6:03                             ` Jason Wang
2025-10-16  6:22                               ` Michael S. Tsirkin
2025-10-16  6:25                                 ` Eugenio Perez Martin
2025-10-17  6:36                                   ` Eugenio Perez Martin
2025-10-17  6:39                                     ` Michael S. Tsirkin
2025-10-17  7:21                                       ` Eugenio Perez Martin
2025-10-22  9:46                                         ` Eugenio Perez Martin
2025-10-22 10:06                                         ` Michael S. Tsirkin
2025-10-22 10:09                         ` Michael S. Tsirkin
2025-10-22 10:50                           ` Eugenio Perez Martin
2025-10-22 11:43                             ` Michael S. Tsirkin [this message]
2025-10-22 12:55                               ` Eugenio Perez Martin
2025-10-28 14:09                                 ` Michael S. Tsirkin
2025-10-28 14:37                                   ` Eugenio Perez Martin
2025-10-28 14:42                                     ` Michael S. Tsirkin
2025-10-28 14:57                                       ` Eugenio Perez Martin
2025-10-29  0:36                                         ` Jason Wang
2025-11-05  9:02                                         ` Eugenio Perez Martin
2025-11-09 21:46                                           ` Michael S. Tsirkin
2026-03-13 11:31                                             ` Eugenio Perez Martin
2026-03-13 11:32                                               ` Eugenio Perez Martin
2026-03-13 12:06                                                 ` Michael S. Tsirkin
2026-03-16  9:26                                                   ` Eugenio Perez Martin
2025-10-07 13:06 ` [RFC 2/2] vduse: lift restriction about net devices with CVQ Eugenio Pérez
2025-10-09 13:14   ` Maxime Coquelin
2025-10-15  6:11     ` Eugenio Perez Martin
2025-10-14  8:31   ` Michael S. Tsirkin
2025-10-15  6:25     ` Eugenio Perez Martin

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20251022073231-mutt-send-email-mst@kernel.org \
    --to=mst@redhat.com \
    --cc=dtatulea@nvidia.com \
    --cc=eperezma@redhat.com \
    --cc=jasowang@redhat.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mcoqueli@redhat.com \
    --cc=virtualization@lists.linux.dev \
    --cc=xieyongji@bytedance.com \
    --cc=xuanzhuo@linux.alibaba.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.