From: "Michael S. Tsirkin" <mst@redhat.com>
To: Stefano Garzarella <sgarzare@redhat.com>
Cc: linux-kernel@vger.kernel.org, ShuangYu <shuangyu@yunyoo.cc>,
"Stefan Hajnoczi" <stefanha@redhat.com>,
"Jason Wang" <jasowang@redhat.com>,
"Eugenio Pérez" <eperezma@redhat.com>,
kvm@vger.kernel.org, virtualization@lists.linux.dev,
netdev@vger.kernel.org
Subject: Re: [PATCH RFC] vhost: fix vhost_get_avail_idx for a non empty ring
Date: Mon, 2 Mar 2026 10:12:11 -0500 [thread overview]
Message-ID: <20260302101125-mutt-send-email-mst@kernel.org> (raw)
In-Reply-To: <aaWTjPXDqhMZlwLr@sgarzare-redhat>
On Mon, Mar 02, 2026 at 03:30:53PM +0100, Stefano Garzarella wrote:
> On Mon, Mar 02, 2026 at 03:51:49AM -0500, Michael S. Tsirkin wrote:
> > vhost_get_avail_idx is supposed to report whether it has updated
> > vq->avail_idx. Instead, it returns whether all entries have been
> > consumed, which is usually the same. But not always - in
> > drivers/vhost/net.c and when mergeable buffers have been enabled, the
> > driver checks whether the combined entries are big enough to store an
> > incoming packet. If not, the driver re-enables notifications with
> > available entries still in the ring. The incorrect return value from
> > vhost_get_avail_idx propagates through vhost_enable_notify and causes
> > the host to livelock if the guest is not making progress, as vhost will
> > immediately disable notifications and retry using the available entries.
>
> Here I'd add something like this just to make it clear the full picture,
> because I spent quite some time to understand how it was related to the
> Fixes tag (which I agree is the right one to use).
>
> This goes back to commit d3bb267bbdcb ("vhost: cache avail index in
> vhost_enable_notify()") which changed vhost_enable_notify() to compare
> the freshly read avail index against vq->last_avail_idx instead of the
> previously cached vq->avail_idx. Commit 7ad472397667 ("vhost: move
> smp_rmb() into vhost_get_avail_idx()") then carried over the same
> comparison when refactoring vhost_enable_notify() to call the unified
> vhost_get_avail_idx().
Indeed.
> >
> > The obvious fix is to make vhost_get_avail_idx do what the comment
> > says it does and report whether new entries have been added.
> >
> > Reported-by: ShuangYu <shuangyu@yunyoo.cc>
> > Fixes: d3bb267bbdcb ("vhost: cache avail index in vhost_enable_notify()")
> > Cc: Stefano Garzarella <sgarzare@redhat.com>
> > Cc: Stefan Hajnoczi <stefanha@redhat.com>
> > Signed-off-by: Michael S. Tsirkin <mst@redhat.com>
> > ---
> >
> > Lightly tested, posting early to simplify testing for the reporter.
>
> Tested with vhost-vsock and I didn't see any issue.
>
> Thanks!
>
> Reviewed-by: Stefano Garzarella <sgarzare@redhat.com>
>
> >
> > drivers/vhost/vhost.c | 11 +++++++----
> > 1 file changed, 7 insertions(+), 4 deletions(-)
> >
> > diff --git a/drivers/vhost/vhost.c b/drivers/vhost/vhost.c
> > index 2f2c45d20883..db329a6f6145 100644
> > --- a/drivers/vhost/vhost.c
> > +++ b/drivers/vhost/vhost.c
> > @@ -1522,6 +1522,7 @@ static void vhost_dev_unlock_vqs(struct vhost_dev *d)
> > static inline int vhost_get_avail_idx(struct vhost_virtqueue *vq)
> > {
> > __virtio16 idx;
> > + u16 avail_idx;
> > int r;
> >
> > r = vhost_get_avail(vq, idx, &vq->avail->idx);
> > @@ -1532,17 +1533,19 @@ static inline int vhost_get_avail_idx(struct vhost_virtqueue *vq)
> > }
> >
> > /* Check it isn't doing very strange thing with available indexes */
> > - vq->avail_idx = vhost16_to_cpu(vq, idx);
> > - if (unlikely((u16)(vq->avail_idx - vq->last_avail_idx) > vq->num)) {
> > + avail_idx = vhost16_to_cpu(vq, idx);
> > + if (unlikely((u16)(avail_idx - vq->last_avail_idx) > vq->num)) {
> > vq_err(vq, "Invalid available index change from %u to %u",
> > - vq->last_avail_idx, vq->avail_idx);
> > + vq->last_avail_idx, avail_idx);
> > return -EINVAL;
> > }
> >
> > /* We're done if there is nothing new */
> > - if (vq->avail_idx == vq->last_avail_idx)
> > + if (avail_idx == vq->avail_idx)
> > return 0;
> >
> > + vq->avail_idx = avail_idx;
> > +
> > /*
> > * We updated vq->avail_idx so we need a memory barrier between
> > * the index read above and the caller reading avail ring entries.
> > --
> > MST
> >
next prev parent reply other threads:[~2026-03-02 15:12 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-03-02 8:51 [PATCH RFC] vhost: fix vhost_get_avail_idx for a non empty ring Michael S. Tsirkin
2026-03-02 14:30 ` Stefano Garzarella
2026-03-02 15:12 ` Michael S. Tsirkin [this message]
2026-03-03 6:38 ` Jason Wang
2026-03-22 9:43 ` ShuangYu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20260302101125-mutt-send-email-mst@kernel.org \
--to=mst@redhat.com \
--cc=eperezma@redhat.com \
--cc=jasowang@redhat.com \
--cc=kvm@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=netdev@vger.kernel.org \
--cc=sgarzare@redhat.com \
--cc=shuangyu@yunyoo.cc \
--cc=stefanha@redhat.com \
--cc=virtualization@lists.linux.dev \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox